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ALLERGENIC PROTEINS AND PEPTIDES 
FROM JAPANESE CEDAR POLLEN 

Background of the Invention 

Genetically predisposed individuals, who make up about 10% of the 
population, become hypersensitized (allergic) to antigens fix^m a variety of 
environmental sources to which they are closed. Those antigens that can induce 
immediate and/or delayed typ^ of hypersensitivity are known as allergens. (King, 
T,P,, Adv. Immmol 23: 77-105, (1976)). Anaphylaxis or atopy, which mcludes the 
syn^toms of hay fever, asthma, and hives, is one form of immediate allergy. It can 
be caused by a variety of atopic allergens, such as products of grasses, trees, weeds, 
animal dander, insects, food, drugs, and chemicals. 

The antibodies involved in atopic allergy belong primarily to the IgE class of * 
immunoglobulins. IgE binds to mast cells and basophils. Upon combination of a 
specific allergen with IgE bound to mast cells or basophils, the IgE may be cross- 
linked on the cell surface, resulting m the physiological effects of IgE-antigen 
interaction. These physiological effects inchide the release of, among other 
substances, histamine, serotonin, heparin, a chemotactic £actor for eosinophilic 
leukocytes and/or the leukotrienes, C4, D4, and E4, which cause prolonged 
constriction of bronchial smooth muscle cells (Hood, L.E. et al. Immunology (2nd 
ed.). The Benjamin/Cumming Publishing Co., Inc. (1984)). These released 
substances are the mediators which result in allergic symptoms caused by a 
combination of IgE with a specific allergen. Through them, the effects of an 
allergen are manifested. Such effects may be systemic or local in nature, dependmg 
on the route by which tiie antigen entered the body and the pattern of deposition of 
IgE on mast cells or basophils. Local manifestations generally occur on epithelial 
surfaces at the location at which the allergen entered the body. Systemic effects can 
include anaphylaxis (anaphylactic shock), which is the result of an IgE-basophil 
response to circulating (intravascular) antigen. 

Japanese cedar (Sugi; Cryptomeria japonica) pollinosis is one of the most 
important allergic diseases in J^an. The number of patients suffering from tiiis 
disease is on the increase arid in some areas, more than 10% of the population are 
affected. Treatment of j£9anese cedar pollinosis by administration of Japanese cedar 
pollen extract to effect hyposensitization to the allergen has been attempted. 
Hyposensitization using Japanese cedar pollen extract, however, has drawbacks in 
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that it can elicit anaphylaxis if high doses are used, whereas when low doses are used 
to avoid anaphylaxis, treatment must be contmued for several years to build up a 
tolerance for the extract. 

The major allergen firom Japanese cedar pollen has been purified and 
designated as Sugi basic protem (SBP) or Cryj 1. This protem is reported to be a 
basic protein with a molecular weight of 41-50 kDa and a pi of 8.8. There appear to 
be multiple isoforms of the allergen, apparently due in part to differential 
glycosylation (Yasueda et al. (1983) /. AUergy Clin. Immunol 71: 77-86; and Taniai 
et al. (1988) FEBS Letters 239: 329-332. The sequence of the fijrst twenty amino 
acids at die N-tenninal end of Cry j I and a sixteen amino acid internal sequence 
have been determined (Taniai supra) . 

A second allergen has recently been isolated from the pollen of Cryptomeria 
japonica (Japanese cedar) (Sakaguchi et al. (1990) Allergy 45:309-312), This • 
allergen, designated Cry j U, has been reported to have a molecular weight of 
approxnnately 37 kDa and 45 kDa when assayed on sodhmi dodecyl sulfate- 
polyacrylamide gel electrophoresis (SDS-PAGE) under non-reducmg and reducmg 
conditions, respectively (Sukaguchietal.,su^. CryyH was found to have no 
immunological cross-reactivity with Cryj I (Sakaguchi (1990) supra; Kawashima et 
al. (1992) /nr. Arch. Allergy Immunol 98:110-117). Most patients with Japanese 
cedar pollmosis were found to have IgE antibodies to both Cryj I and Cryj U. 
however, 29% of allergic patients had IgE that only reacted with Cryj I and 14% of 
allergic patients had IgE that only reacted with Cryj H (Sakaguchi (1990) supra). 
Isoelectric focusing of Cry j U mdicated that this protein has a pi above 9.5, as 
compared to pi 8.6-8.8 for Cry j I (Sakaguchi (1990) supra) . Further, the reported 
NH2-terminal sequence for Cry j H, NH2-AlaIleAsnIlePheAsnValGluLysTyr- 
COOH, did not match that reported for Cryj I (Sakaguchi (1990) supra). 

Despite the attention Japanese cedar pollinosis allergens have received, 
definition or characterization of the allergens responsible for its adverse effects on 
people is far from complete. Current desensitization therapy mvolves treatment with 
pollen extract with its attendant risks of anaphylaxis if high doses of pollen extract 
are administered, or long desensitization times when low doses of pollen extract are 
administered. 

r<i:iinimarv of the Invention 

The present invention provides nucleic acid sequences coding for the 
Cryptomeria japonica major pollen allergen Cry y II and fragments thereof. The 
present invention also provides purified Cry y II and at least one fragment thereof 
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produced in a host cell transfonned with a nucleic acid sequence coding for Cry j U 
or at least one fragment thereof and fragments of Cry j H prepared synthetically. 
As used herein, a fragment of the nucleic acid sequence coding for the entire amino 
acid sequence of Cry j n refers to a nucleotide sequence having fewer bases than the 
5 nucleotide sequence coding for the entire amino acid sequence of Cry j n and/or 
mature Cry j H. Cry j n and fragments thereof are useful for diagnosing, treating, 
and preventing Japanese cedar pollinosis. This invention is more particularly 
described in the appended claims and is described in its preferred embodiments in 
the following description. 

10 

Description of the Drawings 

Fig. la shows an SDS-PAGE (12%) analysis of Cry j n under non-reducing 
conditions. 

Fig. lb shows an SDS-PAGE (12%) analysis of Cry j U under reducing • 
15 conditions. 

Fig. 2 shows the results of mono S column chromatography of Cry j U eluted 
with a step gradient of NaCl in lOmM sodium acetate buffer, pH 5.0. 

Fig. 3 shows an SDS-PAGE (12%) of purified subfractions of Cry j n 
analyzed under reducing conditions. 
20 Fig. 4 shows the nucleic acid sequence (SEQ ID NO: 1) and the deduced 

amino acid (SEQ ID NO: 2) coding for Cry j n. 

Fig. S shows the deduced amino acid sequence of Cry j n (SEQ ID NO: 2). 
Fig. 6 shows the long form (SEQ ID NO: 4) and short form (SEQ ID NO: 5) 
NH2-tenninii amino acid sequences of Cry j n determined by protein sequence 
25 analysis as discussed in Exanq)le 2 aligned with the ten amino acid sequence of Cry 

j n (SEQ ID NO: 3) defined by Sakaguchi et al., supra (SEQ ID NO: 6). 

Fig. 7 is a gr^hic representation of the results of a direct ELISA assay 
showing the binding response of the monoclonal antibody 4B1 1 and seven patients* 
(Batch 1) plasma IgE to purified Cry y I as the coatmg antigen. 
30 Fig. 8 is a graphic representation of a direct ELISA assay showing the 

bmding response of the monoclonal antibody 4B11, and seven patients' (Batch 1) 
plasma IgE to purified native Cry y n as the coating antigen. 

Fig. 9 is a graphic representation of a direct ELISA assay showing the 
binding response of the monoclonal antibody, 4B11, and seven patients* (Batch 1) 
35 plasma IgE to recombinant Cry j n (rCry y II) as the coatmg antigen. 

Fig. 10 is a gr^hic representation of a direct ELISA assay showing the 
binding response of eight patients' (Batch 2) plasma IgE to purified native Cryj 1. 
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Fig. 11 is a graphic representation of a direct ELISA assay showing the 
binding response of eight patients' (Batch 2) plasma IgE to purified native Cryj U. 

Fig. 12 is a graphic representation of a direct ELISA assay showmg the 
binding response of eight patients* (Batch 2) plasma IgE to recombinant Cryj H. 
5 Fig. 13 is a graphic representation of a dhect ELISA assay showing die 

bmding response of eight patients' (Batch 3) plasma IgE to purified native Cryj 1. 

Fig. 14 is a graphic representation of a direct ELISA assay showing the 
bmding response of eight patients' (Batch 3) plasma IgE to purified native Cryj H. 

Fig. 15 is a graphic representation of a direct ELISA assay showing the 
10 bindmg response of eight patients' (Batch 3) plasma IgE to recombmant Cryj H. 

Fig. 16 is a table which summarizes both the MAST scores performed on 
patient's plasma samples (Batch 1-3) and the direct ELISA results shown in Figs. 7- 
15; a positive response is indicated by a (+) sign and the number of positive 
responses for each antigen is shown at the bottom of each colunm. 

15 

Dgtalled D escription of the Invention 

The present invention provides nucleic acid sequences coding for Cryj an 
allergen found in Japanese cedar pollen. The nucleic acid sequence coding for Cryj 
n shown m Fig, 4 (SEQ ID NO: 1) encodes a protein of 514 amino acids. The 

20 deduced Cry j U amino acid sequence is shown in Figs. 4 and 5 (SEQ ID NO: 2) . 
Direct protein sequence analysis of native purified Cryj n resulted in two separate 
overlapping NH2-termini sequences, designated Long and Short, corresponding 
respectively to amino acids 46 through 89 (SEQ ID NO: 4) and 51 through 89 (SEQ 
ID NO: 5) of Figs. 4, 5 and 6. The ten amino acid sequence NH2-AlaIIeAsnIlePhe- 

25 AsnValGluLysTry-COOH (SEQ ID NO: 6) previously defined by Sakaguchi et al, 
supra for Cry j U correspoi^ to ammo acids 55 through 64 of Figs. 4 and 6. The 
full-length Cry j H sequence contains 20 cysteine residues and three potential N- 
linked glycosylation sites with the consensus sequence of Asn-Xxx-Ser/Thr. 
According to the program contained in PC Gene, IntelUgenetics (Mountain View, 

30 CA) the proteins with the NH2-termini defined by the Long and Short forms of Cryj 
n would contain 469 and 464 amino acids, respectively, and have predicted 
molecular weights of 51.5 kDa Oong) and 50.9 kDa (short). The amino acid 
sequence representing the long form of Cry j VL is encoded by the nucleotide 
sequence extending from bases 177-1586 (SEQ ID NO: 7) as shown m Fig. 4, and 

35 the ammo acid sequence representing the short form of Cryj U is encoded by the 
nucleotide sequence extending from 192-1586 (SEQ ID NO: 8) as shown in Fig. 4. 
A host cell transformed with a vector contakdng the cDNA insert coding for full- 
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length Cry y n has been deposited with the American Type Culture Collection, 
ATCC No. 69105. 

Fragments of the nucleic acid sequence coding for fragments of Cry y n are 
also within die scope of the invention. Fragments within the scope of the invention 
5 include those coding for parts of Cry j U which induce an immune response in 
mammals, preferably humans, such as stimulation of minimal amounts of IgE; 
binding of IgE; eliciting the production of IgG and IgM antibodies; or the eliciting 
of a T cell response such as proliferation and/or lymphokine secretion and/or die 
induction of T cell anergy. The foregoing ftagments of Cryj II are referred to herein 

10 as antigenic fragments. Fragments within the scope of the invention also include 
those capable of hybridizing with nucleic acid from other plant species for use in 
screening protocols to detect allergens that are cross-reactive with Cry j n. As used 
herem, a fragment of the nucleic acid sequence coding for Cryj n refers to a 
nucleotide sequence having fewer bases than the nucleotide sequence coding for the 

15 enthe amino acid sequence of Cry j n and/or mature Cry j n. Generally, the nucleic 
acid sequence coding for the fragment or fragments of Cry j n will be selected from 
the bases coding for the mature protein, however, in some instances it may be 
desirable to select all or a part of a fragment or fragments fix>m the leader sequence 
portion of the nucleic acid sequence of the invention. The nucleic acid sequence of 

20 the invention may also contain linker sequences, modified restriction endonuclease 
sites and other sequences useful for cloning, e^^ression or purification of Cryj n or 
fragments thereof. 

A nucleic acid sequence coding for Cry j n may be obtained from 
Cryptomeria jcq?omcapl3sAs. Applicants have found that fiiesh pollen and staminate 

25 cones are a good source of Cry j n mRNA. It n:iay also be possible to obtain the 
nucleic acid sequence coding for Cry j U from genomic DNA. Cryptomeria 
japonica is a well-known species of cedar, and plant material may be obtained from 
wild, cultivated, or ornamental plants. The nucleic acid sequence coding for Cryj n 
noay be obtained using the mediod disclosed herein or any other suitable techniques 

30 for isolation and cloning of genes. The nucleic acid sequeiK:e of the invention may 
beDNAorRNA. 

The present invention provides expression vectors and host cells transformed 
to express the nucleic acid sequences of the invention. Nucleic acid coding for Cryj 
n, or at least one fragment thereof may be e;q)ressed in bacterial cells such as E. 
35 coUj insect cells (baculovirus), yeast, or mammalian cells such as Chinese hamster 
ovary cells (CHO). Suitable expression vectors, promoters, enhancers, and other 
expression control elements may be foimd in Sambrook et al. Molecular Cloning: A 
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Laboratory Manual, second edition. Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, New York (1989). Other suitable expression vectors, promoters, 
enhancers, and other expression elements are known to those skilled in the art. 
Expression m m^^^^^aiia", yeast or insect cells leads to partial or complete 
glycosylation of the recombinant material and formation of any inter- or intra-chain 
disulfide bonds. Suitable vectors for e)q)ression in yeast mclude YepSecl (Baldari et 
al. (1987) Embo 7. 6: 229-234); pMFa (Kurjan and Herskowitz (1982) Cell 30: 933- 
943); JRY88 (Schultz et al. (1987) Gene 54: 113-123) and pYES2 (Invitrogen 
Corporation, San Diego, CA). These vectors are freely avaUable. Baculovirus and 
fnammalian expression systems are also available. For example, a baculovirus 
system is commercially available (PharMingen, San Diego, CA) for expression in 
insect cells while the pMSG vector is commerically available (Pharmacia, 
Piscataway, NJ) for expression in m a mm a lian cells. 

For expression in E. coti, suitable expression vectors include, among others, 
pTRC (Amann et al. (1988) Gene 69: 301-315); pGEX (Amrad Corp., Melbourne, 
Australia); pMAL (N.E. Biolabs, Beverly, MA); pRTTS (Pharmacia, Piscataway, 
NJ); pET-lld (Novagen, Madison, WI) Jameel et al., (1990) /. Virol. 64:3963- 
3966; and pSEM (Knapp et al. (1990) BioTechniques 8: 280-281). The use of 
pTRC, and pET-lld, for example, will lead to the e3q)ression of unfiised protem. 
The use of pMAL, pRTTS pSEM and pGEX will lead to the expression of allergen 
fused to maltose E bindmg protein (pMAL), protein A (pRTTS), truncated 6- 
galactosidase (PSEM), or ghitathione S-transferase (pGEX). When Cryj U, 
fragment, or fiagments thereof is expressed as a fusion protein, it is particularly 
advantageous to mtroduce an enzymatic cleavage site at the fusion junction between 
the carrier protein and Cryj H or fragment thereof. Cryj n or fragment thereof 
may then be recovered horn the fusion protein through enzymatic cleavage at the 
enzymatic site and biochemical purification using conventional techniques for 
purification of proteins and peptides. Suitable enzymatic cleavage sites include those 
for blood clotting Factor Xa or thrombin for which the appropriate enzymes and 
protocols for cleavage are commercially available from for example Sigma Chemical 
Company, St. Louis, MO and N.E. Biolabs, Beverly, MA. The different vectors 
also have different promoter regions aDowing constitutive or inducible ejipression 
with, for example, IPTG induction (PRTC, Amann et al., (1988) supra; pET-lld, 
Novagen, Madison, WI) or temperature induction (pRTTS, Pharmacia, Piscataway, 
NJ) . It may also be appropriate to express recombinant Cry 7 H in different £. coli 
hosts that have an altered capacity to degrade recombinantly expressed proteins (e.g. 
U.S. patent 4,758,512). Alternatively, it may be advantageous to alter the nucleic 
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acid sequence to use codons preferentially utilized by E, coli, where such nucleic 
acid alteration would not affect the ammo acid sequence of the expressed protein. 

Host cells can be transformed to e)q)ress the nucleic acid sequences of the 
invention using conventional techniques such as calcixun phosphate or calcium 
5 chloride co-prccipitation, DEAE-dextran-mediated transfection, or electroporation. 
Suitable methods for transforming the host cells may be foxmd in Sambrook et al. 
supra , and other laboratoiy textbooks. The nucleic acid sequences of the mvention 
may also be synthesized using standard techniques. 

The present invention also provides a method of producing purified Japanese 

10 cedar pollen allergen Cry j Hot at least one fragment thereof conqirising the steps of 
culturing a host cell transformed with a DNA sequence encodmg J^anese cedar 
pollen allergen Cry / n or at least one fragment thereof m an appropriate medmm to 
produce a mixture of cells aiKi medium contaniing said Japanese cedar pollen 
allergen Cry j U or at least one fragment thereof; and purifying the nuxture to 

15 produce substantially pure Japanese cedar pollen allergen Cryj U or at least one 

fragment thereof. Host cells transformed with an expression vector containing DNA 
coding for Oy y n or at least one fragment thereof are cultured m a suitable medium 
for the host cell. Cry j U protem and peptides can be purified bom cell culture 
medhmi, host cells, or both using techniques known in the art for purifying peptides 

20 and proteins mcluding ion-exchange chromatography, gel filtration chromatography, 
ultrafiltration, electrophoresis and immunopurification widi antibodies specific for 
Cry j n or fragments ttiereof . The terms isolated and purified are used 
interchangeably herein and refer to peptides, protein, protein fragments, and nucleic 
acid sequences substantially free of cellular material or culture medium when 

25 produced by recombinant DNA techniques, or chemical precursors. 

Cry j n protein may also be isolated from j£9anese cedar pollen as described 
in Example 1. Oyy n isolated directly from Japanese cedar pollen is referred to 
herem as "purified native" Cry yn. It is preferable fliat purified native Cry yH of 
the mvention be at least 80% pure, and more preferably at least 90% pure and even 

30 more preferably be purified to homogeneity (at least 99% pure). 

Another aspect of the invention provides preparations comprising Japanese 
cedar pollen allergen Cry y U or at least one fragment thereof symhesized in a host 
cell transformed with a DNA sequence encoding all or a portion of J^anese cedar 
pollen allergen Cry y H, or chemically synthesized, and purified J^)anese cedar 

35 poUen allergen Cry y n protein, or at least one antigenic fragment thereof produced 
in a host cell transformed with a nucleic acid sequence of the invention, or 
chemically synthesized. In preferred embodiments of the invention the Cry y n 



wo 94/11512 



PCT/US93/11000 



8 

protein is produced in a host cell transformed with the nucleic acid sequence coding 
for at least the mature Cryj n protein. 

Fragments of an allergen from Cry j II, eliciting a desired antigenic response 
(referred to herein as antigenic firagments) are defined herein as any protein fragment 
or peptide which can be derived from tiie Cryj R proteins, but does not include the 
ten amino acid fragments which extends from anuno acid residues 55-64, as shown 
in Figs. 4, 5 and 6, but may include any portion of that ten amino acid fragment in 
conjunction with another fragment derived from Cryj H. Antigenic fragments of 
CryjU may be obtained, for example, by screening peptides recombinantiy 
produced from the corresponding fragment of the nucleic acid sequence of the 
invention coding for such peptides, or by screemng peptides which have been 
synthesized chemically using techniques known m the art, or by screening peptides 
produced by chemical cleavage of the allergen. The allergen may be arbitrarily 
divided into fragments of a desired length with no overlap of the pq>tides, or 
preferably divided into fragments of a desired length with no overlap of the peptides, 
or preferably divided into overlapping fragments of a desired length. The fragments 
are tested to determine then- antigenicity (e.g. the ability of the fragment to iiriuce 
an rnnnune response such as T cell proliferation as discussed in Example 7). 

Antigenic fragments may also be predicted using an algorithm such as that 
discussed m a paper by Hill et al, Jourml cf Immunology, 147:184-197 (1991). 
Algorithms for predictmg peptides which elicit T cell activity such as the algorithm 
discussed by Hill et al. are based on the protem's sequence wherein certam patterns 
within the sequence are likely to bmd MHC and therefore may contain T cell 
epitopes. The peptides predicted by the algorithm such as Cry j HA and Cry j IIB 
discussed in Example 7 may be produced recombinantiy or synthetically and tested 
for T cell activity as discussed in Exa^^)le 7. 

If fragments of J^anese cedar pollen allergen, e.g. Cryy n are to be used for 
therapeutic purposes, then the fragments of Japanese cedar pollen allergen which are 
cq>able of eliciting a T cell response such as stimulation (i.e., proliferation or 
lymphokine secretion) and/or are citable of indudng T cell anergy are particularly 
desirable and fragments of Japanese cedar pollen which have minimal IgE 
stimulating activity are also desirable. Additionally, for therapeutic purposes, 
pmified Japanese cedar pollen allergens, e.g. Cryj II, and fragments thereof 
preferably do not bind IgE specific for Japanese cedar pollen or bind such IgE to a 
substantially lesser extent than the purified native Japanese cedar pollen allergen 
binds such IgE. If the purified J2q)anese cedar pollen allergen or fragment or 
fragments thereof bind IgE, it is preferable that such binding does not result m the 
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release of mediators (e.g. histamines) firom mast cells or basophils. Minimal IgE 
stimulating activity refers to IgE stimulating activity that is less than the amount of 
IgE production stimulated by the native Cry j U protein. 

Isolated antigenic fragments or peptides of the present invention which have 
T cell stimulating activity, and dius comprise at least one T cell epitope are 
particularly desirable. T cell epitopes are believed to be involved in initiation and 
perpetuation of the imTnime response to a protein allergen which is responsible for 
the clinical symptoms of allergy. These T cell epitopes are thought to trigger early 
events at the level of the T he^r cell by binding to an ^ropriate HLA molecule 
on the surface of an antigen presenting cell and stimulating the relevant T cell 
subpopulation. These events lead to T cell proliferation, lymphokine secretion, local 
inflammatory reactions, recruitment of additional immune cells to the site, and 
activation of the B cell cascade leading to production of antibodies. One isotype of 
these antibodies, IgE, is fundamentally important to the development of allergic 
synq>toms and its production is influenced early in die cascade of events, at the level 
of the T beiper cell, by the nature of the lyn[q)hokmes secreted. An epitope is the 
basic element or smaUest unit of recognition by a receptor, particularly 
immunoglobulins, histocon^)atibility antigens and T cell receptors, where the epitope 
comprises amino acids essential to receptor recognition. Amino acid sequences 
which mimic those of the epitopes particularly T cell epitopes and which modify the 
allergic response to protein allergens including those enable of down regulating 
allergic response to Cry j n, are within the scope of this invention. 

As discussed in Example 7, human T cell stimulating activity can be tested by 
culturing T cells obtained from an todividual sensitive to J^anese cedar pollen 
allergen, (i.e., an individual who has an IgE mediated immune response to J^anese 
cedar pollen allergen) with a peptide derived from the allergen and determining 
whether proliferation of T cells occurs in response to the peptide as measured, e.g., 
by cellular uptake of tritiated thymidine. Stimulation indices for responses by T 
cells to peptides can be calculated as the maximum CPM in response to a peptide 
divided by the control CPM. A stimulation index (S.I.) equal to or greater than two 
times the background level is considered "positive**. Positive results are used to 
calculate the mean stimulation index for each peptide tested. Preferred peptides of 
this invention comprise at least one T cell epitope and have a mean T cell stimulation 
index of greater than or equal to 2.0. A peptide having a mean T cell stimulation 
index of greater than or equal to 2.0 is considered usefid as a tfaer^)eutic agent. As 
shown in Fig. 17 Cry j U peptides Cry j IIA and Cry j IIB have mean stimulation 
indexes of at least two and therefore comprise at least one T cell epitope as 
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predicted. 

Purified protein allergens from J^anese cedar pollen or preferred antigenic 
fragments thereof, when administered to a Japanese cedar pollen-sensitive individual, 
or an individual allergic to an allergen cross-reactive with Japanese cedar pollen 
5 aUergen, are capable of modifying the allergic response of the individual to Japanese 
cedar pollen or such cross-reactive allergen of the individual, and preferably are 
capable of modifying the B-cell response, T-cell response or both the B-cell and the 
T-cell response of the individual to the allergen. As used herein, modification of 
the allergic response of an individual sensitive to a Japanese cedar pollen allergen 

10 can be defined as non-responsiveness or diminution in symptoms to the allergen, as 
determined by standard clinical procedures (See e.g. Vamey et al, British Medical 
Journal, 3^:265-269 (1990)) includmg diminution in J^anese cedar pollen 
induced asthmatic symptoms. As referred to herein, a diminution in symptoms 
includes any reduction in allergic response of an individual to the allergen after the 

15 individual has completed a treatment regimen with a peptide or protein of the 
invention. This diminution may be subjective (i.e. the patient feels more 
comfortable m the presence of the allergen). Diminution in symptoms can be 
detennii^ clinically as well, using standard skin tests as is known in the art. 

The purified Cry j U protein or fragments thereof are preferably tested in 

20 mamTnalifln modcls of Japanese cedar poUinosis such as the mouse model disclosed 
m Tamura et al. (1986) Microbiol. Immunol 30: 883-896, or U.S. patent 
4,939,239; or the primate model disclosed in CM)a et al. (1990) Im. Arch. Allergy 
Immunol 93: 83-88. Initial screening for IgE bindmg to the protein or firagments 
thereof may be performed by scratch tests or intradermal skin tests on laboratory 

25 animals or human volunteers, or in in vitro systems such as RAST 

(radioaUergosorbent test), RAST inhibition, EUSA assay, radioimmunoassay (RIA), 
or hiR tamine release. 

Exposure of allergic individuals to purified protein allergens of the present 
invention or to the antigenic fragments of the present invention which comprise at 

30 least one T cell epitope and are derived from protein allergens may tolerize or 

anergize ^ropriate T cell subpopulations such that they become unresponsive to the 
protem allergen and do not participate in stramlating an hnnmne response iqjon such 
exposure. In addition, administration of the protein allergen of the invention or an 
antigenic fiagment of the present invention which comprises at least one T cell 

35 epitope may modify the lymphokine secretion profile as compared with exposure to 
the naturally-occurring protein allergen or portion thereof (e.g. result in a decrease 
of YLrA and/or an increase m IL-2). Furthermore, exposure to such antigenic 
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fragment or protein allergen may influence T cell subpopulations which normally 
participate in the response to the allergen such that these T cells are drawn away 
from the site(s) of normal exposure to the allergen (e.g., nasal mucosa, skin, and 
lung) towards the site(s) of therapeutic administration of the fragment or protein 
allergen. This redistribution of T cell subpopulations may ameliorate or reduce the 
ability of an individual's immune system to stimulate the usual inmiune response at 
the site of normal exposure to the allergen, resulting in a dimunution in allergic 
symptoms. 

The isolated Cry j U protem, and fragments or portions derived therefrom can 
be used m methods of diagnosing, treating and preventing allergic reactions to 
Japanese cedar pollen allergen or a cross reactive protein allergen. Thus the present 
invention provides therapeutic compositions comprismg purified Japanese cedar 
pollen allergen Cry j n or at least one fragment thereof produced in a host cell 
transformed to express Cry j U or at least one fragment thereof, and a 
pharmaceutically acceptable carrier or diluent. The therapeutic compositions of the * 
invention may also comprise synthetically prepared Cry j n or at least one fragment 
thereof and a pharmaceutically acceptable carrier or diluent. Administration of the 
therapeutic conq>ositions of the present invention to an individual to be desensitized 
can be carried out using known techniques. Cry j n protein or at least one fragment 
tiiereof may be administered to an individual in combmation with, for example, an 
appropriate diluent, a carrier and/or an adjuvant. Pharmaceutically accq)table 
diluents include saline and aqueous buffer solutions. Pharmaceutically acceptable 
carriers include polyethylene glycol (Wie et al. (1981) Int. Arch. Allergy Appl. 
Immunol. 64*84-99) and liposomes (Strejan et al. (1984) /. Neuroimmunol 7: 27). 
For purposes of inducmg T cell anergy, the ther^)eutic composition is preferably 
administered in nonimmimogenic form, e.g. it does not contain adjuvant. Such 
compositions will generally be administered by injection (subcutaneous, intravenous, 
etc.), oral administration, inhalation, transdermal plication or rectal 
administration. The therapeutic compositions of the invention are administered to 
J^anese cedar pollen-sensitive individuals at dosages and for lengths of time 
effective to reduce sensitivity (i.e, reduce the allergic response) of the uidividual to 
Japanese cedar pollen. Effective amounts of the fliera^jeutic compositions wiU vary 
according to factors such as the degree of sensitivity of the individual to Japanese 
cedar pollen, the age, sex, and weight of the individual, and the ability of the Cryj 
U protein or fragment thereof to elicit an antigenic response in the individual. 

The Cry j U cDNA (or the mRN A from which it was transcribed) or a 
portion thereof can be used to identify similar sequences in any variety or type of 
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plant and thus, to identify or "pull out" sequences which have sufficient homology to 
hybridize to the Cry j U cDNA or mRNA or portion thereof, for exanq)le, DNA 
from allergens of Cupressus sempervirens, Jwuperus sabinoides etc. , under 
conditions of low stringency. Those sequences which have sufficient homology 
(generally greater than 40%) can be selected for further assessment using the method 
described herem. Alternatively, high stringency conditions can be used. In fliis 
manner, DNA of the present mvention can be used to identify, m other types of 
plants, preferably related families, genera, or species such as Juniperus, or 
Cupressus^ sequences encoding polypeptides having amino acid sequences similar to 
that of Japanese cedar pollen allergen Cry j n, and thus to identify allergens m other 
species. Thus, the present invention includes not only Cryj n, but also other 
allergens encoded by DNA which hybridizes to DNA of the present mvention. The 
mvention further includes previously unidentified isolated allergenic proteins or 
fragments thereof that are nnmunologically related to Cryj n or fragments thereof, 
such as by antibody cross-reactivity wherein the isolated allergenic proteins or 
fragments thereof are capable of binding to antibodies specific for the protein and 
peptides of the invention, or by T cell cross-reactivity wherein the isolated 
allergenic proteins or fragments thereof are capable of stimulating T cells specific for 
the protein and peptides of this invention. 

Proteins or peptides encoded by the cDNA of the present invention can be 
used, for example as "purified" allergens. Such purified allergens are usefiil in the 
standardization of allergen extracts which are key reagents for the diagixosis and 
treatment of J^anese cedar pollinosis. Furthermore, by using peptides based on the 
nucleic acid sequences of Cry j n, anti-peptide antisera or monoclonal antibodies can 
be made using standard methods. These sera or monoclonal antibodies can be used 
to standardize allergen extracts. 

Through use of the peptides and protein of the present invention, preparations 
of consistent, well-defined composition and biological activity can be made and 
administered for then^utic purposes (e.g. to modify the allergic response of a 
J^anese cedar sensitive individual to pollen of such trees). Administration of such 
peptides or protein may, for example, modify B-cell response to Cry j U allergen, 
modify T-cell response to Cry jU allergen or modify both B-cell and T-cell 
responses. Purified peptides can also be used to study the mechanism of 
inununotherapy of Cryptomeria jcporuca aUergy and to design modified derivatives 
or analogues useful in immunotherapy. 

Work by others has shown that high doses of allergens generally produce the 
best results (i.e., best symptom relief). However, many people are unable to 
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tolerate large doses of allergens because of allergic reactions to the allergens. 
Modification of naturally-occurring allergens can be designed in such a manner that 
modified peptides or modified allergens which have the same or enhanced 
therapeutic properties as the corresponding naturally-occurring allergen but have 
reduced side effects (especially anaphylactic reactions) can be produced. These can 
be, for exanq)le, a protein or peptide of the present invention (e.g. , one havmg all or 
a portion of the amino acid sequence of Cryj E), or a modified protein or peptide, 
or protein or peptide analogue. 

It is possible to modify the structure of a protein or peptide of the invention 
for such purposes as increasing solubility, enhancing therapeutic or preventive 
efficacy, or stability (e.g., shelf life m vivo , and resistance to proteolytic 
degradation in vivo) . A modified protem or peptide can be produced in which the 
amino acid sequence has been altered, such as by amino acid substitution, deletion, 
or addition, to modify immunogenicity and/or reduce allergenicity, or to which a 
component has been added for the same purpose. For example, the amino acid 
residues essential to T cell epitope function can be determined usmg known 
techniques (e.g., substitution of each residue and d^rmination of the presence or 
absence of T cell reactivity). 

For exanq)le, a pq)tide can be modified so that it maintaias the ability 
to mduce T cell anergy and biirf MHC protems without the ability to induce a strong 
proliferative response or possibly any proliferative response when administered m 
immunogenic form. In this histance, critical binding residues for the T ceU receptor 
can be determined using known techniques (e.g., substitution of each residue and 
determination of the presence or absence of T cell reactivity). Those residues shown 
to be essential to interact with the T cell recq)tor can be modified by replacing the 
essential amino acid with another, preferably similar amino acid residue (a 
conservative substitution) whose presence is shown to enhance, diminish but not 
eliminate binding to relevant MHC. 

Additionally, peptides of the invention can be modified by replacing an 
amino acid shown to be essential to interact with the MHC protein complex with 
another, preferably similar amino acid residue (conservative substitution) whose 
presence is shown to enhance, diminish but not eliminate or not effect T cell 
activity. In addition, amino acid residues which are not essential for interaction with 
die MHC protem complex but which still bind the MHC protein complex can be 
modified by being replaced by another amino acid whose incorporation may 
enhance, not effea, or diminish but not eliminate T cell reactivity. Preferred amino 
acid substitutions for non-essential amino acids include, but are not limited to 
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substitutions with alanine, glutamic acid, or a methyl amino acid. 

Another example of a modification of protein or peptides is substitution of 
cysteine residues preferably with alanine, serine, threonine, leucine or ghitamic acid 
to minimir^ dimerization via disulfide linkages. Another example of modification of 
the peptides of the invention is by chemical modification of amino acid side chains 
or cyclization of the peptide. 

In order to enhance stability and/or reactivity, the protein or peptides of the 
invention can also be modified to mcorporate one or more polymorphisms in the 
amino acid sequence of the protein allergen resultmg from natural allelic variation. 
Additionally, D-amino acids, non-natural amino acids or non-amino acid analogues 
can be substituted or added to produce a modified protein or pq)tide within the scope 
of this invention. Furthermore, proteins or peptides of the present invention can be 
modified using the polyethylene glycol (PEG) method of A. Sehon and co-workers 
(Wie et al. supra) to produce a protein or pq)tide conjugated with PEG. In addition, 
PEG can be added during chemical synthesis of a protem or peptide of the invention. 
Modifications of proteins or peptides or portions thereof can also include reduction/ 
alyklation (Tarr in: Methods of Protein Microcharacterizption, I.E. Silver ed. 
Humana Press. Clifton, NJ, pp 155-194 (1986)); acylation (Tarr, supra) : chemical 
coupling to an appropriate carrier (Mishell and Shiigi, eds. Selected Methods in 
Cellular Immunology, WH Freeman, San Francisco, CA (1980); U.S. Patent 
4,939,239; or mild formalin treatment (Marsh International Archives of Allergy and 
Applied Immunology, 41:199-215 (1971)). 

To facilitate purification and potentially increase solubility of proteins or 
peptides of the invention, it is possible to add reporter groiqKs) to the peptide 
backbone. For exanq>le, poly-histidine can be added to a p^tide to purify the 
peptide on inamobilized metal ion affinity chromatography (Hochuli, E. et al., 
Bio/Technology, 6:1321-1325 (1988)). In addition, specific endoprotease cleavage 
sites can be introduced, if desired, between a reporter group and amino acid 
sequences of a peptide to facilitate isolation of peptides firee of krelevant sequences. 
In order to successfully desensitize an individual to a protein antigen, it may be 
necessary to increase the solubility of a protein or peptide by adding functional 
groups to the peptide or by not including hydrophobic T cell epitopes or regions 
containing hydrophobic ^itopes in the peptides or hydrophobic regions of the 
protein or peptide. 

To potentially aid proper antigen processing of T cell epitopes within a 
peptide, canonical protease sensitive sites can be recombinanfly or synthetically 
engineered between regions, each comprising at least one T cell epitope. For 
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exanq)le, charged amino acid pairs, such as KK or RR, can be introduced between 
regions within a peptide during recombinant construction of the peptide. The 
resulting peptide can be rendered sensitive to cathepsm and/or other trypsin-like 
enzymes cleavage to generate portions of the peptide containing one or more T cell 

5 epitopes. In addition, such charged anuno acid residues can result in an increase in 
solubility of a peptide. 

Site-directed mutagenesis of DN A encoding a peptide or protein of the 
invention (e.g. Cry y n or a fragment thereof) can be used to modify the structure of 
the peptide or protein by methods known in the art. Such methods may, among 

10 others, iiK:tude PGR with degenerate oligonucleotides (Ho et al.. Gene, 77:51-59 

(1989)) or total synthesis of mutated genes (Hostomslg^, Z. et al., Biochem. Biophys, 
Res, Cb/nm., 161: 1056-1063 (1989)). To enhance bacterial expression, the 
aforementioned methods can be used in conjunction with other procedures to change 
the eucaiyotic codons in DNA constructs encoding protein or peptides of the 

15 invention to ones preferentially used in E. coli, yeast, m a mm a li a n cells, or other 
eukaiyotic cells. 

Using the structural information now available, it is possible to design Cryj 
n peptides which, when administered to a Jsq^anese cedar pollen sensitive individual 
in sufficient quantities, will modify the individual's allergic response to Japanese 

20 cedar pollen. This can be done, for example, by examining the structure of Cryj n, 
producing peptides (via an expression system, synthetically or otherwise) to be 
examined for then: ability to infhience B-cell and/or T-cell responses in Japanese 
cedar pollen sensitive individuals and selecting appropriate peptides which contain 
epitopes recognized by the cells. It is now also possible to design an agent or a drug 

25 C2q>able of blocking or inhibiting the ability of Japanese cedar pollen allergen to 
induce an allergic reaction in Japanese cedar pollen sensitive individuals. Such 
agents could be designed, for exan^)le, in such a manner that they would bind to 
relevant anti-Cry j U IgEs, thus preventing IgE-aDergen bmding and subsequent mast 
cell degranulation. Alternatively, such agents could bind to cellular components of 

30 the iniTTiuTie system, resulting in suppression or desensitization of the allergic 

response to Cryptomeria japonica pollen allergens. A non-restrictive example of 
this is the use of appropriate B- and T-<:ell epitope peptides, or modifications 
thereof, based on the cDNA/protein structures of the present invention to siq>press 
the allergic response to Japanese cedar pollen. This can be carried out by defining 

35 the structures of B- and T-cell epitope peptides which affect B- and T-cell function in 
in vitro studies with blood components from Japanese cedar pollen sensitive 
individuals. 



wo 94/1 1512 



PCT/US93/11000 



16 

Protein, peptides or antibodies of the present invention can also be used for 
detecting and diagnosing Japanese cedar pollinosis. For example, this could be done 
by combining blood or blood products obtained from an individual to be assessed for 
sensitivity to Japanese cedar pollen with an isolated antigenic peptide or peptides of 
Cry j n, or isolated Cry j II protein, under conditions appropriate for binding of 
components in the blood (e.g., antibodies, T-cells, B-cells) with the peptide(s) or 
protein and determining the extent to which such binding occurs. Other diagnostic 
methods for allergic diseases which the protein, peptides or antibodies of the present 
invention can be used include radio-allergosorbent test (RAST), paper 
radiohnmunosorbent test (PRIST), enzyme linked nnmunosorbent assay (EUSA), 
radioimmunoassays (RIA), immuno-radiometric assays (IRMA), luminescence 
immunoassays (UA), histamine release assays and IgE immunoblots. 

In another diagnostic test, the presence in individuals of IgE specific 
for Cry j II at least one protein allergen and the ability of T cells of the individuals to 
respond to T cell epitope(s) of Cry j n protein allergen can be determined by 
administering to the individuals an Immediate Type Hypersensitivity test and a 
Delayed Type Hypersensitivity test. The individuals are administered an Immediate 
Type Hypersensitivity test (see e.g. Inummology (1985) Roitt, I.M., Brostoff, J., 
Male, D.K. (eds), C.V. Mosby Co., Gower Medical Publishing, London, NY, pp. 
19,2-19. 18; pp. 22. 1-22. 10) utilizing the Cry i II protem allergen or a portion 
thereof, or a modified form of the Cry y 27 protein allergen or a portion thereof, 
each of which binds IgE specific for the allergen. The same individuals are 
administeied a Delayed Type Hypersensitivity test prior to, simultaneously with, or 
subsequent to administraiton of the Immediate Type Hypersensitivity test. Of 
course, if the Immediate Type Hypersensitivity test is administered prior to the 
Delayed Type Hypersensitivity test, the Delayed Type Hypersensitivity test would be 
given to those individuals exhibiting a specific Immediate Type Hypersensitivity 
reaction. The Delayed Type Hypersensitivity test utilizes a modified form of the 
protem allergen or a portion thereof, the protein allergen produced recombinantly, or 
a recombitope peptide derived from the protein allergen, each of which has human T 
cell stimulating activity and each of which does not bind IgE specific for the allergen 
in a substantial percentage of the popiilation of individuals sensitive to the allergen 
(e.g., at least about 75%). Based on the results of the above diagnostic tests, those 
individuals found to have both a specific Immediate Type Hypersensitivity reaction 
and a specific Delayed Type Hypersensitivity reaction are suitable candidates for 
administration of a therapeutically effective amount of a therapeutic composition. 
The therapeutic composition comprises the modified form of the protein or portion 
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* 

thereof, the recombmantly produced protein allergen, or the recombitope peptide, 
each as used in the Delayed Type Hypersensitivity test, and a pharmaceutically 
acceptable carrier or diluent. 

The present invention also provides a method of producing Cryj n or 
fragment thereof comprising culturing a host cell containing an expression vector 
which contains DNA encoding all or at least one fragment of Cryj II under 
conditions appropriate for e^ipression of Cryj n or at least one fragment. Hie 
expressed product is then recovered, usmg known techniques. Alternatively, Cryj n 
or fragment th^eof can be synthesized usmg known mechanical or chemical 
techniques. 

The DNA used in any embodiment of this invention can be cDNA obtained 
as described herein, or alternatively, can be any oligodeoxymicleotide sequence 
having all or a portion of a sequence represented herem, or their functional 
equivalents. Such oligodeoxynucleotide sequences can be produced chemically or 
enzymatically, using known techniques. A functional equivalent of an 
oligonucleotide sequence is one which is 1) a sequence capable of hybridizing to a 
conq)lementary oligonucleotide to which the sequence (or corresponding sequence 
portions) of Cry j n or fragments thereof hybridizes, or 2) the sequence (or 
corresponding sequence portion) conq>lementary to Cryj n, and/or 3) a sequence 
which encodes a product (e.g. , a polypeptide or peptide) having the same functional 
characteristics of the product encoded by the sequence (or corresponding sequence 
portion) of Cry j H. Whether a functional equivalent must meet one or both criteria 
will depend on its use (e.g., if it is to be used only as an oligoprobe, it need meet 
only the first or second criteria and if it is to be used to produce a Cryj n allergen, 
it need only meet the third criterion). 

The invention is further illustrated by the following non-lnnitmg examples. 

Exanq>lel 

Purification of Native Japanese Cedar Pollen Allergen f Ov / ID 

The following purification of native Cry j U from Japanese cedar pollen was 
modified from previously published reports (Yasueda et al, /. Allergy CUn. 
Immunol. 71:77 (1983); Sukaguchi et al.. Allergy, 45:309 (1990)). 

lOOg of Japanese cedar pollen obtained from Japan (Hollister-Stier, Spokane, 
WA) was defatted in IL diethyl ether three times, the pollen was collected after 
filtration and the ether was dried off in a vacuum. 

The defatted pollen was extracted at 4^0 overnight in 2L extraction buffer 
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containing 50 mM tris-HQ, pH 7.8, 0.2 M NaCl and protease inhibitors in final 
concentrations: soybean trypsin inhibitor (2 ng/mL), leupeptin (1 ng/mL), pepstatin 
A (1 jig/mL) and phenyl methyl sulfox^l fluoride (0. 17 mg/mL). The msohible 
material was re-extrated with 1 .2L extraction buffer at 4^C overnight and both 
5 extracts were combined together and depigmented by batch absorption with 
Whatman DE-52 (200g dry weight) equilibrated with the extraction buffer. 

The depigmented material was then ftactionated by ammonium sulfate 
precipitation at 80% saturation (40C), which removed much of the lower molecular 
weight material. The resulting pellet was resuspended in 0.4 L of 50 mM Na- 
10 acetate, pH 5.0 containing protease inhibitors and was dialyzed extensively against 
the same buffer. 

The sample was furflier subjected to purification by either one of the two 
methods described below. 

15 Method A 

The sample was applied to a 100 mL DEAE cellulose column (Whatman DE- 
52) equilibrated at 4^C with 50 mM Na-acetate, pH 5.0 widi protease inhibitors. 
The unbound material (basic proteins) ftom flie DEAE cellulose column was then 
applied to a 50 ml cation exchange column (Whatman CM-52) which was 

20 equilibrated with 10 mM Na-acetate. pH 5.0 at 4^C with protease inhibitors. A 

linear gradient of 0-0.3 MNaa was used to elute the protems. The early fractions 
were enriched m Cry y I whereas the later fractions were enriched in Cryj H. 
Fractions containmg Cry j JL were pooled and next applied to an 1 mL Mono S HR 
5/5 column (Pharmacia, Piscataway, NJ) in 10 mM Na-acetate, pH 5.0, and proteins 

25 wereeluted with a Imear gradient of NaCl at room tenq>erature. Residual Cry/ 1 
was ehited at "0.2 M NaCl and Cry y n was ehited between 0,3 to 0.4 M NaCl. 
The Cryj n peak was pooled and concentrated to twofold by lyophilization and 
subjected to gel filtration chromatogr^hy. 

The sample was ^lied to FPLC Superdex 75 16/60 column (Pharmacia, 

30 Piscataway, NJ) in 10 mM acetate buffer, pH 5.0 and 0.15 M NaCl at a flow rate of 
30 ml/min. at room tenq)erature. Purified Cryj U was recovered m the 35-30 kD 
region. CryjU migrated as two broad bands lower than Cryj I under non-reducmg 
conditions (Fig. la) but both bands shifted upward and migrated as Cryj I under 
reducing condition (Fig. lb) when analyzed by silver-stained SDS-PAGE, This 

35 highly purified Cryy H still contained a small amount ("5%) of Cryj I as detected by 
Western blot using MAb CBF2, ^ch has been shown to bind to Cryj I and by N- 
terminal protem sequencing. This Cryj H preparation was used to generate primary 
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protein sequence of Cry jUas described below. 
Method B 

The dialyzed sample fix)m the ammonium sulfate precipitation was applied at 
5 1 ml/min to an 5.0 ml Q-Sepharose Econapac anion exchange cartridge (BioRad, 
Richmond, CA) equilibrated with 50 mM Na-acetate, pH 5.0 with protease 
inhibitors at 4^C. Ehition was performed with the above buffer containing 0.5 M 
NaCl. The basic unbound material was then applied to a 5.0 ml CM-Sepharose 
Econopac cation exchange cartridge (BioRad, Richmond, CA) equilibrated in 50 mM 

10 sodrnm acetate pH 5.0 with protease mhibitors. Basic protems were ehited with a 
linear gradient iq) to 0.1 M sodhmi phosphate pH 7.0, 0.3 M NaQ at 1 ml/min at 
4^C. A Oy 7 n -enriched peak was coUected late m the gradient arid fu^ 
purified by gel filtration chromatogr^hy. 

FPLC gel filtration was periformed using a 320 mL Superdex 75 26/60 

15 (Pharmacia, Piscataway, NJ) cohmm at 0.5 ml/min in 20 mM sodiwn acetate, pH 
5.0, in the presence of 0.15 MNaCl. The major peak contammg mosdy Oyy U 
eluted between 160 and 190 ml. Contaminating Cryj I was next removed by FPLC 
using a 1.0 ml Mono S 5/5 (Pharmacia, Piscataway, NJ) cation exchange column 
equilibrated with 10 mM sodium acetate pH 5.0. A stepwise gradient of 0-1 M 

20 NaCl was utilized by holdmg isocratically at 0.2 M, 0.3 M, 0,4 M and 1 M salt 
concentration. 

Multiple peaks (up to nine peaks) were obtamed (Fig. 2) and analyzed by 
silver stained SDS-PAGE under reducmg conditions (Fig. 3). Cry J I with a 
reported pi of 8.6-8.9 (Yasueda et al, /. AUergy Clin, Immunol, vol. 17 (1983)), 

25 eluted in the earlier peaks and displayed a molecular weight of about 40 kD. CryjU 
was purified to homogeneity as two bands (Fig. 3) and eluted in the later multiple 
peaks, suggesting the existemre of isoforms. ELISA analysis iising the mouse 
monoclonal 8B11 IgG antibody which was raised agamst biochraucaUy purified Cry 
j I confirmed the absence of Cry y I in these purified Cryj n preparation. This 

30 purified Cry j U was used m the human IgE reactivity studies (Example 6). 

Physical proper ties of Ov /II 

The physiochemical properties of Cryj U were studied and summarized as 
below. Under non-redudng SDS-PAGE conditions Cry y U consists of two bands 
35 with molecular weights ranged 34000-32000. The molecular weights of both bands 
are shifted higher to about 38-36 kD under redudng conditions (Fig. lb). This shift 
in SDS-polyacrylamide gel has also been observed by others (Sakaguchi et al. 
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AUergy45:309-312 (1990)). These results suggest that intra-disulfide bonds are 
probably present in the protein, and it is siqjported by the present findings that 
cloned Cry j Jl contains 20 cysteines deduced fix)m the nucleotide sequence (Example 
3). The pi of Cry j U estimated from lEF gel is about 10. The purified Cryj E 
binds human IgE of some allergic patients. 

The two molecular weight bands of Cryj 11 were separated on a 12% SDS- 
polyacrylamide gel and was then electroblotted onto PVDF membrane (Applied 
Biosystems, Foster City, CA). The blot was stamed with coomassie brilliant blue 
and was cut and subjected to N-terminal amino acid sequencing. (Example 2). The 
results showed that the upper and lower molecular weight bands had identical N- 
terminal sequences except the lower molecular weight band missed the first five 
amino acids. The estimated molecular weight of the \sppcr band based on the cDNA 
sequence is about 52,000, which is significantly higher than the molecular weight 
estimated from SDS-polyacrylamide gel either in the presence or absence of reducing 
reagent. It is also higher than that obtained from gel filtration and prelhninary mass 
spectroscopy analysis. These are several possibilities to account for this difference. 
One possibility is that Cry j U protein is processed. It is probable that the N- 
terminal and C-terminal of the protein are cleaved. It is not clear at the present time 
whether this processing occurs in the cell or due to proteolysis during purification 
even though fom: different protease inhibitors were added in most of the purification 
steps. Nevertheless, the two N-terminal sequences obtained from the purified Cryj 
n (Example 2) also contained the N-terminal sequence (10 amino acid) published by 
Sakaguchi et al (Allergy, 45:309-312(1990)) suggesting that the N-termmal of Cryj 
n is probably hydrolyzed. Since Sakaguchi et al. (supra) , did not use any protease 
inhibitors in their purification, a higher degree of hydrolysis might have occurred. 
This could explain why the N-terminal amino acid sequence that Sakaguchi et al. 
obtained was downstream of the N-terminal sequences as discussed in Example 2. 

Another ^roach which may be used to purify native Cry j U or recombmant 
Cry y n is inmnmoaffinity chromatogr^hy. This technique provides a very selective 
protem purification due to the specificity of the interaction between monoclonal 
antibodies and antigen. Murine polyclonal and monoclonal antibodies are generated 
against purified Cry j U, These antibodies are used for purification, 
characterization, analysis and diagnosis of the allergen Cryj H. 

Example 2 

Protein Sequencing of Purified Cryj U 

Cry j n protein was isolated as in Example 1. The doublet band shown on 
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SDS-PAGE (Fig. la) was electroblotted onto ProBlott (Applied Biosystems, Foster 
City, CA). Sequencing was performed with the Beckman/Porton Microsequencer 
(model LF3000, Beckman Instruments, Carlsbad, CA), a Programmable Solvent 
Module (Beckman System Gold Model 126, Beckman Instuments, Carlsbad, CA) 
5 and a Diode Array Detector Module for PTH-amino acid detection (Beckman System 
Gold Model 168, Beckman Instruments, Carlsbad, CA) following manufacturers 
specifications. 

A single N-terminal sequence analysis of the upper doublet band and multq>le 
N-terminal sequence analyses of the lower doublet band showed that both bands 

10 contained two N-termini, designated "long" and "short". The lower doublet band 
contained approximately 3.3 picomoles of the long form and 8.3 picomoles of the 
short form. This difference in yields was sufficient to make sequence assignments 
accoidii^ to the quantitation at each sequencer cycle. The ^pper doublet band 
contained approximately 8.3 picomoles of both sequences. The revealed long 

15 sequence was NH2-RKVEHSRHDAINnwnBKYGAVGIX}KH-DCT 

()()() KNP ( ) -COOH, (SEQ ID NO: 4) where (Q) indicates a tentative 
identification of glutamine at position 38 and 0 indicated unknown residues at 
positions 39-41 and 45. The revealed "short" sequence was NH2- 
SWTOAINffNVEKYGAVGDGKHDCTEAFSTAW^^ (SEQ ID NO: 5). 

20 Thus the long Cry j U sequence had five additional amino terminal residues than the 
short form and the sequence of the short form exactly matched that of the long form. 
In addition, both the long and short forms of Cry j Q contained the ten amino acids, 
NH2-AINIFNVEKY-COOH (SEQ ID NO: 6), previously described for Cryj H 
(Sakaguchi et al. 1990, supra) . The previously published ten amino acids 

25 (Sakaguchi et al. 1990, supra) correspond to amino acids ten through 19 of the long 
form described above. 

Examples 

Extraction of RNA From Japanese Cedar Pollen and Staminate Cones and 

30 Cloniny of Crv/n 

Fresh pollen and staminate cone samples, collected from a single 
Cryptomeria japoruca (Jq)anese Cedar) tree at the Arnold Arboretum (Boston, MA), 
were fix)zen immediately on dry ice. RNA was prepared fix)m 500 mg of each 
san5)le, essentially as described by Frankis and Mascariienas (1980) Aim. Bot. 45 : 

35 595-599. The san^)les were ground by mortar and pestle on dry ice and suspended 
in 5 ml of 50 mM Tris pH 9.0 with 0.2 M NaCl, 1 mM EDTA, 0.1% SDS that had 
been treated overnight with 0. 1 % diethyl pyrocarfoonate (DEPC). After five 
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extractions with phenol/chlorofonn/isoamyl alcohol (mixed 25:24:1), the RNA was 
precipitated firom the aqueous phase with 0. 1 volume 3M sodium acetate and 2 
volumes etfaanol. The pellets were recovered by centrifiigation, resuspended m 2 ml 
dH20 and heated to 65**C for 5 minutes* Two ml 4M lithium chloride was added to 
the preparation and the RNA was precipitated overnight at 0**C. The RNA pellets 
were recovered by centrifiigation, resuspended in 1 ml dH20, and again precipitated 
wiQi 3M sodiimi acetate and ethanol on dry ice for one hour. The final pellet was 
washed with 70% ethatK)l, air dried and resuspended in 100 ^1 DEPC-treated dH20 
and stored at -SO^'C. 

Double stranded cDNA was synthesized from 4 fig pollen RNA or 8 ^g 
flowerhead RNA usmg a commercially available kit (cDNA Synthesis System kit, 
BRL, Gaithersburg, MD). The double-stranded cDNA was phenol extracted, 
ethanol precipitated, blunted with T4 DNA polymerase (Promega, Madison, WI), 
and then ligated to ethanol precipitated, self annealed, AT and AL oligonucleotides 
for use in a modified Anchored PGR reaction, according to the m^od of Rafioar et 
aL (1990) J. Biol. Chem. 2^: 1229-1236 ; Frohman et al, (1990) Proc. Natl. 
Acad. Sci. USA 85: 8998-9002; and Roux et al. (1990) BioTech. 8: 48-57. 
Oligonucleotide AT has the sequence (SEQ ID NO: 10) 
5'-G<3GTCTAGAGGTACCG-TCCGTCCGATCGATCATT-3* (Rafnar^o/. 
supra) . Oligonucleotide AL has the sequence (SEQ ID NO: 11) 
5'-AATGATCGATGCT (Rafiiar et al, supra) . 

The first attempts at amplifying the amino terminus of Cryj U from the 
linkered cDNA (2 ^1 of a 20 /tl reaction) was made using the degenerate 
oligonucleotide CP-11 and oligonucleotide AP. CP-11 has the sequence (SEQ ID 
NO: 12) 5'-ATACITCTCIACGTTGAA-3' , wherein A at positon 1 can be G, C at 
position 4 can be T, C at position 7 can be T, I at position 10 is inosine to reduce 
degeneracy (Knoth et al. (1988) Nucleic Acids Res. 16: 10932), G at position 13 can 
be A, and G at position 16 can be A). AP, which has the sequence (SEQ ID NO: 
13) 5*-GGGTCTAGAGGTA-CCGTCCG-3', corresponds to nucleotides 1 through 
20 of the oligonucleotide AT. C3P-11 is the degenerate oligonucleotide sequence that 
is complementary to the coding strand sequence substantially encoding amino acids 
PheAsnValGluLysTyr (SEQ ID NO: 14) (amino acids 59 to 64 of Fig. 4), which 
correspond to the carboxy terminus of the previously published Cryj U sequence 
(Sakaguchi et al. , supra) shown in Fig. 4. All oligonucleotides were synthesized by 
Research Genetics Inc. , Huntsville, AL. 

Polymerase chain reactions (PGR) were carried out using a commercially 
available kit (GeneAmp DNA Amplification kit, Perkm Ehner CeUis, Norwalk, CT) 
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whereby 10 ^1 IQx buffer containing dNTPs was mixed with 100 pmoies of each 
oligonucleotide, cDNA (3-5 fil of a 20 first strand cDNA reaction mix), 0.5 /il 
Amplitaq DNA polymerase, and distilled water to 100 /il. 

The san^les were amplified with a programmable thermal controller (MJ 
5 Research, Inc., Cambridge, MA). The first 5 rounds of amplification consisted of 
denaturation at 94°C for 1 min, annealing of primers to the template at 45*'C for 1 
min, and chain elongation at 72**C for 1 min. The final 20 rounds of an5)lification 
consisted of denaturation as above, annealing at 55''C for 1 min, and elongation as 
above. The primary PGR reaction was carried out with 100 pmol each of the 

10 oligonucleotides AP and CP-1 1 . Five percent (5 /xl) of this mitial amplification was 
then used in a secondary an^lification with 100 pmoies each of AP and CP-12. CP- 
12 has the sequence (SEQ ID NO: 15) 5'-CCTGCAGTACTrCT- 
CIACGTTGAAIAT-3* , wherein C at position 10 can be T, C at position 13 can be 
T, I at positions 16 and 25 are inosines to reduce degeneracy as above, G at position 

15 19 can be A, and G at position 22 can be A. The sequence (SEQ ID NO: 16) 5'- 
CCTGCAG-3' (bases 1 through 7 of CP-12) represents a Pst I site added for 
cloning purposes; the remaining degenerate oligonucleotide sequence is 
complementary to the coding strand sequence that substantially encodes the amino 
acids nePheAsnValGhiLysTyr (SEQ ID NO: 17) (amino acids 58-64 of Fig. 4). 

20 Anoplified DNA was recovered by sequential chloroform, phenol, and chloroform 
extractions, foUowed by precipitation on dry ice with 0.5 volumes of 7.5M 
ammonium acetate and 1.5 volumes of isopropanol. After precipitation and washing 
with 70% eflianol, the DNA was simultaneously digested with Xba I and PstliaB. 
50 /il reaction, precipitated to reduce the volume to 10 fil, and electrophoresed 

25 through a preparative 2% GTG NuSeive low mek gel (FMC, Rockport, ME). The 
appropriate sized DNA area was visualized by ediidium bromide (EtBr) staining, 
excised, and ligated into appropriately digested pUC19 for sequencing by ttie 
dideoxy cham tenmnation method of Sanger et al. (1977) Proc. Natl. Acad. Sci. 
USA 74: 5463-5476) using a commercially available sequencing kit (Sequenase kit, 

30 U.S. Biochemicals, Cleveland, OH). All resultant clones were sequenced, and none 
were found to contain Cry j U sequence. An alternate 2° PGR reaction was 
performed with AP and the nested oligonucleotide CP-21. CP-21 has the sequence 
(SEQ ID NO: 18) 5'-CCTGCAGTACTTCTCIACGTTGAAGAT-3* wherem C at 
position 10 can be T, C at position 13 can be T, I at position 16 is inosine to reduce 

35 degeneracy as above, G at position 19 can be A, G at position 22 can be A, and G at 
position 25 can be A or T. The sequence (SEQ ID NO: 16) 5'-CCTGCAG-3* (bases 
1 through 7 of CP-21) represent a PjT I site added for cloning purposes; the 
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remaining degenerate oligonucleotide sequence is the non-coding strand sequence 
corresponding to coding strand sequence substantially encoding amino acids 
IlePheAsnValGluLysTyr (SEQ ID NO: 17) (amino acids 58 to 64 of Fig. 4). 

A primary PGR was also performed on double-stranded, linkered cDNA 
using CP-23D and AP, as above, to attempt to amplify the 3' end of the Cryj U 
cDNA. A secondary PGR was performed using 5 % of the primary reaction, usmg 
CP-24D and AP. CP-23D (sequence (SEQ ID NO: 19) 5*- 
GCIATTAATATnTTAA-3', wherein die T at position 6 can be C or A, T at 
position 9 can be C, T at position 12 can be C or A, and T at position 15 can be C ) 
is the coding strand sequence substantially encodmg amino acids AlalleAsnllePheAsn 
(SEQ ID NO: 20) (amino acids 55 to 60 of Fig. 4); CP-24D (SEQ ID NO: 21) 
(sequence 5'-GGAATTCCGCIATTAATATTTTTAATGT-3', wherein flie T at 
position 14 can be C or A. T at position 17 can be C, T at position 20 can be C or 
A. T at position 23 can be C, and T at position 26 can be C ) contains the sequence 
5*-GGAATTCC-3' (SEQ ID NO: 22) (bases 1 through 8 of CP-24), which 
represents an Eco RI site added for cloning purposes. The remaining degenerate 
oligonucleotide sequence of CP-24D substantially encodes amino acids 
AlalleAsnDePheAsnVal (SEQ ID NO: 23) (amino acids 55 to 61 of Fig. 4). Agam, 
multiple clones were sequenced, none of which could be identified as Cryj n, and 
this approach was not pursued further. 

Upon the characterization of novel Cry j U protein sequence data described in 
Example 2, new degenerate oligonucleotides for cloning Cryj U were designed and 
syndiesized. All oligonucleotides mentioned hereafter were synthesized on an ABI 
394 DNA/RNA Synthesizer (Applied Biosystems, Foster City, CA), and purified on 
NAP-10 cohnnns (Pharmacia, Uppsala, Sweden) as per the manufacturers* 
instructions. Degenerate oligonucleotide CP-35 was used with AP on the double- 
stranded linkered cDNA in a primary PGR reaction carried out as described herein. 
CP-35 has the sequence (SEQ ID NO: 24) 5'-GCTTCGGTACAATCATGTTT-3', 
wherein T at position 3 can also be C; G at position 6 can also be A, T or C; A at 
position 9 can also be G; A at position 12 can also be G; A at position 15 can be G; 
and T at position 18 can also be C; this degenerate oligonucleotide sequence is the 
non-coding strand sequence corresponding to coding strand sequence substantially 
encoding ammo acids LysHisAspCysThrGluAla of Cryj U (SEQ ID NO: 25) (amino 
acids 71 to 77 of Fig, 4). Five percent (5 /d) of diis initial amplification, designated 
JC136, was then used m a secondary amplification with 100 pmoles each of AP and 
degenerate Cry j U primer CP-36, an internally nested Cryj n oligonucleotide primer 
with die sequence (SEQ ID NO: 26) 5'- 
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GGCrrGCAGGTACAATCATGTTTGCCATC-3' wherein A at position 11 can also 
be G; A at position 14 can also be G; A at position 17 can also be G; T at position 20 
can also be C; G at position 23 can also be A, T, or C; and A at position 26 can also 
be G. The nucleotides 5*-GGCTGCAG-3' (SEQ ID NO: 27) (bases 1 through 8 of 
CP'36) represent siPstl restriction site added for cloning purposes. The remaining 
degenerate oligonucleotide sequence of CP-36 is the non-coding strand sequ^ice 
corresponding to coding strand sequence substantially encoding amino acids 
AspGIyLysHisAspCysThr of Cry j H (SEQ ID NO: 28) (amino acids 69 to 75 of Fig. 
4). The dominant amplified product, designated JC137, was a DNA band of 
approximately 265 base pairs, as visualized on an EtBr-stained 2% GTG agarose gel. 

AII^>lified DNA was recovered by sequential chloroform, pheiK)l, and 
chloroform extractions, followed by precipitation at -20**C with 0.5 volumes of 7.5 
ammonium acetate and 1.5 vohnnes of isopropanol. After precipitation and wasUng 
with 70% ethanol, the DNA was simultaneously digested with JOki I and I in a 
15 ^1 reaction and electrophoresed through a preparative 2% GTG SeaPlaque low 
melt gel (FMC, Roclq)ort, ME). The appropriate sized DNA band was visualized 
by EtBr staining, excised, and ligated into ^ropriately digested pUC19 for 
sequencing by the dideoxy chain termination method (Sanger et al. (1977) Proc. Natl 
Acad Sci. USA 74: 5463-5476) using a commercially available sequencing kit 
(Sequenase kit, U.S. Biochemicals, Cleveland, OH). 

The clones designated pUC19JC137a, pUC19JC137b, and pUC19JC137e 
were found to contain sequences encoding the amino terminus of Cry j U. All 
three clones had identical sequence in their regions of overlap, although all three 
clones had different lengths in the 5* untranslated region. Clone pUC19JC137b 
was the longest clone. The translated sequence of these clones had complete 
idratity to the disclosed 10 amino acid sequence of Cry jU (Sakaguchi et al., 
supra .)> as well as to ^ Cry j U amino acid sequeiK:e described in Example 2. 
Amino acid numbering is based on the sequence of the fidl length protein; amino 
acid 1 corresponds to the initiatmg methionine (Met) of CryJ H. The position of 
the initiating Met was supported by the presence of an i5)stream in-ftame-stop 
codon and by 78% homology of the surrounding nucleotide sequence with the 
plant consensus sequence that encompasses the initiating Met, as r^rted by 
Lutcke et al. (1987) EMBO 7. 6:43-48. 

The cDN A encoding the remainder of Cry j U gene was cloned from the 
linkered cDNA by using oligonucleotides CP-37 (SEQ ID NO: 29) (which has the 
sequence 5'-ATGTTGGACAGTGTrGTCGAA-3') and AP in a primary PCR, 
designated JC138ii. Oligonucleotide CP-37 corresponds to nucleotides 129 to 149 of 



wo 94/11512 



PCT/US93/11000 



26 

Fig. 4, and is based on the nucleotide sequence determined for the partial Cryj n 
clone pUC19JC137b. 

A secondary PCR reaction was performed on 5% of the initial amplification 
mixture, with 100 pmoles each of AP and CP-38 (SEQ ID NO: 30) (which has the 
5 sequence 5'-GGGAATTCAGAAAAGTTGAGCATTCTCGT-3'), the nested primer. 
The nucleotide sequence (SEQ ID NO: 31) 5*-GGGAATTC-3' (bases 1 through 8 of 
CP-38) represents an Eco RI restriction site added for cloning purposes. The 
remaining oligonucleotide sequence corresponds to nucleotides 177 to 197 of Fig. 4, 
and is based on the nucleotide sequence determined for the partial Cryj U clone 

10 pUC19JC137b. The amplified DNA product, designated JC140iii, was purified and 
precipitated as above, followed by digestion with Eco RI and Asp 718 and 
electrophoresis through a preparative 1 % low melt gel. The dominant DNA band, 
which was ^proximately 1.55 kb m length, was excised and ligated mto pUC19 for 
sequencing. DNA was sequemred by the dideoxy cham termination method (Sanger 

15 et al. supra) using a commercially available kit (sequenase kit (U.S. Biochemicals, 
Cleveland, OH). Both strands were conq)letely sequenced using M13 forward and 
reverse primers (N.E. Biolabs, Beverly, MA) and mtemal sequencing primers CP- 
35, CP-38, CP-40, CP-41, CP-42, CP-43, CP-44, CP-45, CP-46, CP-47, CP-48, 
CP-49,CP-50, and CP-51. CP-40 (SEQ ID NO: 32) has the sequence 5*- 

20 GTTCTTCAATGGGCCATGT-3 ' and corresponds to nucleotides 359 to 377 of Fig. 
4. CP-41 (SEQ ID NO: 33) has the sequence 5'- GTGTTAGGACT- 
GTCTCTCGG-3', which is the non-coding strand sequence that corresponds to 
nucleotides 720 to 739 of Fig. 4. CP-42 (SEQ ID NO: 35) has the sequence 
5*-TGTC;CAGGCCAT-GGAATAAG-3', which corresponds to nucleotides 864 to 

25 883 of Fig. 4 except that the first nucleotide was synthesized as a T rather than the 
correctG. CP-43 has the sequence (SEQ ID NO: 35) 5'- 

GCCTTACATGGACTGCAACC-3', which is the non-coding strand sequence that 
corresponds to nucleotides 1476 to 1495 of Fig. 4. CP-44 has the sequence (SEQ 
ID NO: 36) 5'-TCCACGGGTCTGATAATCCA-3\ which corresponds to 

30 nucleotides 612 to 631 of Fig. 4. CP-45 has the sequence (SEQ ID NO: 37) 

5'-AGGCAGGAAGCAATTTT-CCC-3' , which is the non-coding strand sequence 
that corresponds to nucleotides 1254 to 1273 of Fig. 4. CP-46 has the sequence 
(SEQ ID NO: 38) 5'-TACTGCACTTCAGCT-TCTGC-3', which corresponds to 
nucleotides 1077 to 1096 of Fig. 4. CP-47 has the sequence (SEQ ID NO: 39) 

35 5'-GGGGGTCTCCGAATTTATCA-3\ which is the non-coding strand sequence that 
substantially corresponds to nucleotides 1039 to 1058 of Fig. 4, except that the fifth 
nucleotide of CP-47 was synthesized as a G rather than the correct nucleotide, T. 
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C3*-48 (SEQ ID NO: 40), which has the sequence 5'- 

GGATATTTCAGTGGACACGT-S', corresponds to nucleotides 1290 to 1309 of 
Fig. 4. CP-49 (SEQ ID NO: 41) has the sequence 5'-TATTAGAAGACC- 
CTGTGCCT-3* , which is the non-coding strand sequence that corresponds to 
5 micleotides821 to840ofFig. 4, CP-50 (SEQ ID NO: 42) has the sequence 

5'-CCATGTAACKK:CAAGTTAGT-3', which corresponds to nucleotides 1485 to 
1504 of Fig. 4. CP-51 (SEQ ID NO: 43) has the sequence 

5'-ACACCnTACCCATTAGAGT-3', which is the non-coding strand sequence ttiat 
corresponds to nucleotides 486 to 505 of Fig. 4. 

10 Three clones, designated pUC19JC140iiia, pUC19JC140iud and 

pUC19JC140iiie, were subsequently found to contain partial Cryj U sequence. The 
sequence of clone pUC19JC140iiid was chosen as the consensus sequence since it 
had the longest 3' untranslated region. The sequences of pUC19JC140iiid and 
pUC19JC137b were used to construct the composite Cryj U sequence shown in Fig. 

15 4. In this composite, nucleotide 230 is reported as the A found in pUC19JC137b 
(also. pUC19JC137a, pUC19JC140iiia and pUC19JC140iiie) not as the G found in 
pUC19JC140iiid; however both A and G at nucleotide 230 encode Lys at amino acid 
63. The sequence of clone pUC19JC140iiia was identical to that of pUC19JC140iiid 
excq)t for the following: pUC19JC140iiia has a T at nucleotide 357 in place of a C 

20 (no predicted change in amino acid 106), has C at nucleotide 754 instead of T 
(changes ammo acid 238 firom De to Thr), C at nucleotide 1246 instead of T 
(changes amino acid 4XX1 firom Leu to Pro), and T at nucleotide 1672 uistead of C 
(untranslated region). The sequence of clone pUC19JC140iiie was identical to Aat 
of pUC19JC140iiid except for G at nucleotide 794 mstead of A (changes amino acid 

25 251 from He to Met), and T at nucleotide 357 in place of C (no predicted change in 
amino acid 106). 

An earlier attempt at cloning the JC140iii PGR product using znEco Bl/Xba 
I digest (oligonucleotide AP has hoihXbalsaidAspllS restriction enzyme sites) 
yielded cDNA that was cut in half due to an internal Xba I restriction site m the Cry 

30 j n cDNA, giving rise to 800 and 750 bp bands; the 750 bp band was succesfully 
cloned into Eco Bl/Xba I digested pUC19 and sequenced. Two 750 bp clones were 
sequenced and found to be flie 5' half of the Cryj U molecule: clones pUC19JC140- 
2a and pUC19JC140-2b. Clone pUC19JC140-2a has C for nucloeotide 297 instead 
of T (changes amino acid 86 from Cys to Arg) and clone pUC19JC140-2b has G for 

35 nucleotide 753 instead of A (changes amino acid 238 from lie to Val). Bofli clone 

pUC19JC140-2a and clone pUC19JC140-2b have a T at nucleotide 357 in place of C 
(no predicted change in amino acid 106). 
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Two different PGR amplifications were also sequenced directly to verify the 
clonal Cry j n sequence using the An^litaq Cycle Sequencing kit (Perkin Elmer 
Cetus, Norwalk, CT). This procedure involves the [32p]-end-labelling of 
oligonucleotide sequencing primers which are then aimealled (1 .6 pmoles in 1 ^1) to 
5 template DNA and elongated with dideoxy NTPs (methodology of Sanger et al. 

(1977) Proc. Natl Acad. Sci. USA 74:5463-5476) in a PGR reaction also containing 
4 m1 lOX Cycling Mix (contains 0.5 U//il Amplitaq DNA Polymerase), 5 ^1 template 
DNA (10-100 fimoles) and dH20 to 20 ^1 • The dGTP in the termination mixes in 
this kit have been replaced by 7-deaza-dGTP, which provides increased resohition of 

10 sequences containing high G+C regions of DNA. The tenq)late DNA was a PGR 
product that was recovered by sequential chloroform, phenol, and chloroform 
extractions, precipitated at -TXPC with 0.5 volumes of 7.5 ammomum acetate and 
1.5 volumes of isopropanol, then electrophoresed through a preparative 1 or 2% 
SeaPlaque low melt gel (FMG). Appropriate sized DNA bands were visualized by 

15 EtBr staining, excised, and treated witii Gelase (Epicentre Technologies, Madison, 
WI) to remove the agarose. The DNA was again precipitated, and resuspended m 
50 III TE (10 mM Tris, pH 7.4, 1 mM EDTA, pH 8.0) containing 20 |ig/ml RNAse 
(Boehringer Mannhenn, Indian^lis, IN). Two secondary an?)lifications which had 
been used to clone CryjH were repeated, and used as template DNA for PGR cycle 

20 sequencing: JG137ii, die 5* end PGR, (amplified from flie r PGR JG136 above) 
was reamplified with oligonucleotides AP and GP-36; and JG140ii, the 3* end PGR, 
(amplified ftom the 1** PGR JG138ii above) was reamplified witii oligonucleotides 
AP and GP-38. Both of the 1** anq>lifications used were precipitated, 
electrophoresed through a preparative 1 or 2% SeaPlaque low melt gel (FMG), and 

25 the appropriate sized bands were visualized by EtBr staining and excised. Two ^ of 
each 1** anq)lification was then used m the corresponding 2** PGR reaction. The 2** 
PGR product was then prepared as DNA ten5)late for PGR cycle sequencmg as 
described above. The oligonucleotides used as primers in PGR cycle sequencing, 
many of which were used to sequence die clones, are as follows: for JG137ii, GP-36 

30 and GP-39 (SEQ ID NO: 44), wbkii has flie sequence 5'- 

CTGTGGAAGATAATTTGGGC-3' and is flie non-coding strand sequence 
correspondmg to nucleotides 120 to 139 of Fig. 4. The oligonucleotide prinxers used 
for sequencmg JG140ii were GP-38, GP-40, CP-41, GP-42, CP-43, GP^, GP-45, 
CP-46, CP-47, GP-49, CP-50, CP-54 (SEQ ID NO: 45), which has the sequence 5'- 

35 GATGGCAGGGTGGTTGAGGC-3 ' , corresponds to nucleotides 985 to 1004 of Fig. 
4, GP-55 (SEQ ID NO: 46), which has tiie sequence 

5'-TAGGGGGATTTACGTGGAGG-3' and is the non-coding strand sequence that 
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corresponds to nucleotides 929 to 948 of Fig. 4, and CP-56 (SEQ ID NO: 47), 
wWch has the sequence 5'-TTGGGGTCGAGGCCTCCGAA-3' himI corresponds to 
nucleotides 1437 to 1456 of Fig. 4. The sequence of this full-length PGR cycle 
sequencing had only 2 nucleotide changes from the composite 
pUC19JC137b/pUC19JC140iiid Cry j U sequence shown in Figure 4, neither of 
which lead to an amino acid change. There was a T instead of C at nucleotide 357 
(no predicted change m amino acid 106), and a C instead of A at nucleotide 635 (no 
amino acid change). 

The nucleotide and predicted amino acid sequences of Cryj n are shown m 
Figs. 4 and 5. This is a conq)osite nucleotide sequence from the two overlapping 
clones pUC19JC137b and pUC19JC140iiid. Sequencing of multiple independent 
clones and cycle sequencmg of PGR product confirmed the nucleotide sequence of 
Figure 4. There were several nucleotide changes resultmg in predicted amino acid 
changes, as cited above. However, all nucleotide polymorphisms, with the 
exception of the T for G substitition at nucleotide 357, were only observed in smgle 
clones or sequencing reactions. Although T was seen at nucleotide 357 in aU clones 
except pUG19JG140iiid, both G and T encode Lai at amino acid 106. 

The con5)lete cDNA sequence for Oy y U is composed of 1726 nucleotides, 
including 41 nucleotides of 5' untranslated sequence, an open reading frame of 1542 
nucleotides starting with the codon for an initiatiag Met (nucleotides 42-44 of Fig. 
4), and a 143 bp 3* untranslated region. There is a consensus polyadenylation signal 
sequence in the 3* untranslated region 64 nucleotides 5' to the poly A tail 
(nucleotides 1654-1659 of Fig. 4). The position of tiie initiating Met is confirmed 
by the preseiK:e of an in-frame upstream stop codon and by 78% homology with the 
plant consensus sequence that eiK:onq>asses the mitiating Met (TAAAAUGGG (bases 
38 tiirough 46 of Fig. 4 (SEQ ID NO: 48)) found in Cryj U compared with the 
AAGAAUGGG (SEQ ID NO: 49) consensus sequence for plants, Lutcke et al. 
(1987) EMBO 7. 6: 43-48). The open readmg frame encodes a deduced protein of 
514 amino acids that has complete sequence identity witii the published partial 
protein sequence for Cry j II (Sakaguchi et al. supra) > which corresponds to amino 
acids55 through64 of Fig. 4. The predicted Cry/ H protein has 20 Gys, contains 
four potential N-linked glycosylation sites corresponding to the consensus sequence 
N-X-S/T, has a predicted molecular weight of 56.6 kDa and a predicted pl of 9.08. 

Detection of three separate NH2 termmi sequences for Cryj II (the long form 
and the short form as determined in Example 2 and the NH2 terminus determined by 
Sakaguchi et al., supra , as shown in Fig. 6) may suggest that the amino terminus of 
the mature Cry j n protein is blocked and that the sequences obtained by sequence 
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analysis of purified protein represent proteolytic cleavage products. As shown in 
Fig. 6, the amino acid sequence of the long form of Cryj n begins at amino acid 46 
and the amino acid ^quence of the short form of Cryj n begins at amino acid 51; 
and the NH2-tenninal sequence determed by Sakaguchi et al. begins at amino acid 
54. It is also possible fliat amino acids 1 to 45 represent the leader/pre-pro position 
of Cry j n that is enzymatically cleaved to give a functionally active protein 
beginnmg at amino acid 46 of Fig. 4. The sequences beginning at anuno acids 51 
and 54 represent breakdown products of the protein beginning at amino acid 46. 
There is a predicted cleavage site between amino acids 22 and 23 of Fig. 4 using the 
method of von Heijne (Nucleic Acids Res. (1986) 14:4683-4690). If the mature Cry 
j n protein started at amino acid 23 in Fig. 4, the protein would be 492 amino acids 
long with a predicted molecular weight of 54.2 kDa and a predicted pi of 9.0. 

Searching the Swiss-Prot data base with the Cryj n sequence demonstrated 
that Cryy n is 43.3% homologous (33.3% identical to polygalacturonase of tomato 
(lycopersicon esculentum) and 48.4% homologous (32.6% identical) to 
polygalacturonase of com, Zea mays. All nucleotide and amino acid sequence 
analyses were performed using PCGENE (Intelligenetics, Mountain View, CA.). 
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Extract! n f RNA from Japanese Cedar Fdllen Collected in Japan and 
Egression of Recombinant Cryj H 

Fresh poUen collected £rom a pool of Cryptomeria japordca (J^anese cedar) 
trees in Japan was firozen immediately on dry ice. RNA was prepared from 500 mg 
of tbe pollen, essentially as described by Frankis and Mascarenbas Am. BoL 45:595- 
599. Tbe samples were gromid by mortar and pesde on dry ice and suspended in 5 
mlof 50mMTrispH9.0wifli0.2MNaCl, ImMEDTA, 1 % SDS tbat had been 
treated overnight with 0. 1 % DEPC. After five extractions with phenol/chloroform 
/isoamyl alcohol (mixed at 25:24:1), the RNA was precipitated from the aqueous 
phase with 0.1 volume 3 M sodmm acetate and 2 volumes ethanol. The pellets were 
recovered by centrifugation, resuspended in 2 ml dH20 and heated to 65^C for 5 
minutes. Two ml of 4 M lithium chloride were added to the RNA preparations and 
they were mcubated overnight at O^C. The RNA pellets were recovered by 
centrifugation, resuspended in 1 ml dH20» and again precipitated with 3 M sodium 
acetate and ethanol overnight. The final pellets were resuspended in 100 ^1 dH20 
and stored at -80^C. 

Double stranded cDNA was synthesized from 8 /tg pollen RNA using the 
cDNA Synthesis Systems kit (BRL) with oligo dT priming according to the method 
of Gubler and HofGnan (1983) Gene 25:263-269. PCRs were carried out using the 
GeneAmp DNA Amplification kit (Perkm Elmer Cetus) whereby 10 /il lOx buffer 
containing dNTPs was nuxed with 100 pmol each of a sense oligonucleotide and an 
anti-sense oligonucleotide, cDNA (10 ^1 of a 400 ^1 double stranded cDNA reaction 
mix), 0.5 /d Amplitaq DNA polymerase, and distilled water to 100 ^il. 

The sanq>les were amplified with a programmable diermal controller from 
MJ Research, Inc. (Cambridge, MA). The first 5 rounds of amplification consisted 
of denaturation at 94^C for 1 min, antipgiing of primers to the template at 45^C for 
1 min, and chain elongation at 72^C for 1 min. The final 20 rounds of an^lification 
consisted of denaturation as above, annealing at 55^C for 1 min, and elongation as 
above. 

A new set of primer pairs was synthesized for amplification of a Cry j n 
cDNA from the mitiating Met to the stop codon. CP-52 (SEQ ID NO: 50) has the 
sequence 5'- GCCGAATTCATGGCCATGAAATTAATT-3' where the nucleotide 
sequence 5'-GCCGAATTC-3' (SEQ ID NO: 51) (bases 1 through 9 of CP-52 
represents an Eco RI restriction site added for cloning purposes, and the remaining 
sequence corresponds to nucleotides 42 to 59 of Fig. 4. CP-53 (SEQ ID NO: 52) 



wo 94/11512 



PCr/US93/11000 



32 

« 

has the sequence 5*-CGGGGATCCTCATTATGGATG-GTAGAT-3' where the 
nucleotide sequence 5'-CGGGGATCC-3' (SEQ ID NO: 53) (bases 1 through 9 of 
CP-53 represents a Bam HI restriction site added for cloning purposes, and the 
remauiing oligonucleotide sequence of CP-53 is complementary to coding strand 
5 sequence conesponding to nucleotides 1572 to 1589 of Fig. 4. The PGR reaction 
with CP-52 and CP-53 on the double stranded Japanese Cedar pollen cDNA yielded 
a band of approximately 1.55 Kb on an EtBr-stained agarose mmigel, and was called 
JC145, An5)lified DNA was recovered by sequential chloroform, phenol, and 
chloroform extractions, followed by precipitation at -20^C with 0.5 volumes of 7.5 

10 ammonium acetate and 1.5 volumes of isopropanol. After precipitation and washing 
with 70% ethanol. the DNA was simiiltaneously digested with Eco RI and Bam HI in 
a 15 III reaction, and electrophoresed through a preparative 1 % SeaPlaque low melt 
gel (FMC). Appropriate sized DNA bands were visualized by EtBr staining, 
excised, and ligated into qrp^riately digested pUC19 for sequencing by the 

15 dideoxy chain termination method (Sanger et al. (1977) Proc. Natl. Acad, ScL USA 
74:5463-5476) using a commercially available sequencmg kit (Sequenase kit, U.S. 
Biochemicals, Cleveland, OH). 

Clones pUC19JC145a and pUC19JC145b were completely sequenced usmg 
M13 forward and reverse primers (N.E. Biolabs, Beverly, MA) and internal 

20 sequencmgprimersCP-41, CP-42, CP-44, CP-46, andCP-51. The nucleotide and 
deduced amino acid sequences of clones pUC19JC145a and pUC19JC145b were 
identical to the Cry j n sequence of Fig. 4, with the foUowing exceptions. Clone 
pUC19JC145a was found to contain a single nucleotide difference from the 
previously known Cry j U sequence: it has a C at nucleotide position 1234 of Fig. 4 

25 rather than the previously described T. This nucleotide change results in a predicted 
ammo acid change from lie to Thr at amino acid 398 of the Cryj U protein. Clone 
pUC19JC145b has a G at nucleotide position 1088 of Fig. 4 rather than the 
previously described A, and an A for a G at nucleotide 1339. The nucleotide change 
at 1088 is silent and does not result in a predicted amino acid change. The 

30 nucleotide change at position 1339 results in a predicted amino acid change from Ser 
to Asn at amino acid 433 of the Cry j E protein. None of tiiese polymorphisms have 
yet been confirmed by independently-derived PCR clones or by direa amino acid 
sequencing and may be due to the inherent error rate of Taq polymerase 
(approximately 2 x 10-^, Saiki et al. (1988) Science 239:487-491). However, such 

35 polymorphisms in primary nucleotide and amino acid sequences are expected. 

E>q)ression of Cry J U was performed as follows. Ten /ig of pUC19JC145b 
was digested simultaneously with Eco RI and Bam HI. The nucleotide insert 
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encoding Oy j n (extending from nucleotide 42 through 1589 of Fig. 4) was 
isolated by electrophoresis of this digest through a 1 % SeaPlaque low melt agarose 
gel. The insert was then ligated into the 2q)propriately digested expression vector 
pET-lld (Novagen, Madison, WI; Jameel et al. (1990) /. Virol 64:3963-3966) 
5 modified to contain a sequence encoding 6 histidines (His 6) nnniediately 3' of the 
ATG initiation codon followed by a unique Eco RI endonuclease restriction site. A 
second Eco RI endonuclease restriction site m the vector, along with neighboring Cla 
I and flmd HI eirionuclease restriction sites, had previously been removed by 
digestion with £coRI and Hind m,bhmtmg and religation. The histidine (His6) 

10 sequence was added for affinity purification of die recombinant protein {Cryj I) on a 
Ni2+ chelating cohmm (HochuU et al. (1987) /. Chromatog. 411:177-184; Hochuli 
et al. (1988) Bio/Tech. 6:1321-1325.). A recombmant clone was used to transform 
Escherichia coU strain BL21-DE3, which harbors a plasmid that has an isopropyl-B- 
D-thiogalactopyranoside (IPTG)-mducible promoter precedmg the gene encodmg T7 

15 polymerase. Induction with IPTG leads to high levels of T7 polyinerase expression, 
which is necessary for expression of the recombmant protem m pET-lld. Qone 
pET-lldAHRhis6lC145b.a was confirmed to be a Cryj n clone m the correct 
readmg frame for expression by dideoxy sequencing (Sanger et al. stq)ra) with CP- 
39. 

20 E3q)ressionof fte recombinant protein was exainiiied in an iiiitial small 

culture. An overnight culture of clone pET-1 ldAHRhis6JC145b.a was used to 
innoculate 50 ml of media (Bram Heart Infusion Media, Difco) contammg ampicillm 
(2(X) ^ig/ml), grown to an A600 = 10 ^ titen mduced with IPTG (1 mM, final 
concentration) for 2 hrs. Od& ml aliquots of the bacteria were collected before and 

25 after mduction, pelleted by centrifugation, and crude cell lysates prepared by boiling 
thepeUetsfor5mfflutesin50mMTrisHCl.pH6.8,2mMEDTA, 1%SDS, 1% 
B-mercaptoedianol, 10% glycerol, 0.25% bromophenol blue (Studier et al., (1990) 
Methods in Enzymology 185:60-89). Recombinant protein expression was examined 
on a 12% Coomassie blue-stained SDS-PAGE gel, according to the method m 

30 Sambrook et al., sunra . on which 25 /il of the crude lysates were loaded. A negative 
control consisted of crude lysate from uninduced bacteria containmg flie plasmid 
with Cryj H. There was no notable increase in production of any recombinant E. 
coU protem m the range of 58 Kd, the size predicted for the recombmant Cry j H 
with the His6 leader. 

35 The pET-1 ldAHRhis6JC145b.a clone was then grown on a larger scale to 

examine if there was any recombinant protem being expressed. A 2 ml culture of 
bacteria contaming the recombinant plasmid was grown for 8 hr, then 3 ii\ was 
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spread onto each of 6 (100 x 15 mm) petri plates with 1.5% agarose in LB medium 
(Gibco-BRL, Gaithersburg, MD) containing 200 fig/xxH ampicillin, grown to 
confluence overnight, then scraped into 6 L of liquid media (Brain Heart Infusion 
media, Difco) containing ampicillin (200 iig/TsU). The culture was grown until the 
absorbance at A600 was 1.0, IPTG added (1 mM final concentration), and the 
culture grown for an additional 2 hours. 

Bacteria were recovered by centrifugation (7,930 xg, 10 min) and lysed in 50 
ml of 6M Guanidine-HCl, O.IM Na2HP04, pH 8.0, for 1 hour with vigorous 
shaking. Insoluble material was removed by centrifugation (11,000 xg, 10 min, 4° 
C). The pH of the lysate was adjusted to pH 8.0, and the lysate applied to a 50 ml 
Nickel NTA agarose cohmm ((Jiagen) that had been equilibrated with 6 M 
Guanidine HCl, 100 mM Na2HP04, pH 8.0. The colunm was sequentially washed 
with 6 M Guanidine HCl, 100 mM Na2HP04, 10 mM Tris-HCl, pH 8.0, then 8 M 
urea, 100 mM Na2HP04, pH 8.0, and finally 8 M urea, 100 mM sodium acetate, 
10 mM Tris-HCl, pH 6.3. The column was washed with each buffer until the flow 
through had an A280<- 0.05. 

The recombinant Cry j U protein was ehited with 8 M urea, 100 mM sodfaim 
acetate, 10 mM Tris-HCl, pH 4.5, and collected m 10 ml aliquots. The protem 
concentration of each firaction was determined by A280 and the peak fractions 
pooled. An aliquot of the collected recombinant protein was analyzed on SDS- 
PAGE according to the method in Sambrook et al. supra . 

This 6L prep, JCHpET-l, yielded 1.5 mg of recombinant Cry J H, which was 
resolved into 2 major bands on SDS-PAGE at 58 kDa and 24 kDa. The 58 kDa 
band, which represents recombinant Cryj n, was {proximately 9-10% of the total 
protein as determined by densitometry measurement (Shimadzu Flying Spot Scanner, 
Shimadzu Scientific Instruments, Inc., Braintree, MA). The 24 kDa band accounts 
for about 90% of the total protein and may represent a degradation product of the 
recombinant Cry y n or an E. coU contaminant. 

Another Cry j U expression construct was made by the ligation of the 
pUC19JC140iiid Cry j U insert into appropriately digested pETlldAHR (with the 6 
histidine leader). The vector was derived flx)m another pETlldAHR construct 
whose insert supplied an EcoR I site (at the 5' pETlldAHR-insert junction) and an 
Asp 718 site (at the 3* end of the insert); the construct was digested with these two 
enzymes, run on a low melt minigel as above, and the vector recovered as a band in 
low melt agarose. The pUC19JC140iiid construct was digested with Eco R I and 
Asp 718 to release the Cry j n insert, which was isolated on a low meh minigel and 
Ugated into the Eco R I/Asp 718 digested pETlldAHR vector prepared above. Five 
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clones were found to contain the correct nucleotide sequence at the insert/vector 5' 
junction, when sequenced by dideoxy sequencing (as above) with CP-39. This new 
construct, when e;q)ressed, would begin at amino acid 46 of Cryj n as shown in 
Figs. 4 and 5, This recombinant protein is designated rCry j U A46. A 50 ml small 
scale expression test (as performed above) showed that the e;q)ression level of rCry j 
n A46 from this construct, designated pETlldAHRJC140iiid2, would be much 
greater than die initial expression level from pETlldAHRJC145b2. A9Lprep, 
JCI^ET-3, was processed as above, and yielded 200 mg of xCryj U A46 at 80% 
purity as determined by densitometry of a Coomasie blue stained 12% SDS-PAGE 
gel. 

Examples 

Northern blot on RNA from Japanese Cedar PbU en Sources 

A northern blot analysis was perforaied on the RNA isolated firom J2q)anese 
Cedar pollen from both the Arnold Arboretum tree and the pooled trees firom Japan. 
Using essentially the method of Sambrook, supra > ten ng of RNA isolated from 
Japanese cedar pollen collected fix>m the Arnold Arboretum (Boston, MA) and 15 iig 
pooled RNA firom Japanese cedar pollen collected firom trees in Japan were nm on a 
1.2% agarose gel contaming 38% formaldehyde and IX MOPS (20X = 0.4M 
MOPS, 0.02M EDTA, O.IM NaOAc, pH 7.0) solution. The RNA samples (first 
precipitated with 1/10 volume sodium acetate, 2 volumes ethanol to reduce volume 
and resuspended in 5.5 /il dH20) were run with 10 itX fonnaldehyde/formamide 
buffer containing loading dyes with 15.5% formaldehyde, 42% formamide, and 
L3X MOPS solution, final concentration. The san^iles were transferred to 
Genescreen Plus (NEN Research Products, Boston, MA) by c^illary transfer m lOX 
SSC (20X = 3M NaCl, 0.3M Sodram Citrate), after which the membrane was 
baked 2 hrs at SO^C and UV irradiated for 3 minutes. Prehybridization of the 
membrane was at 60OC for 1 hour m 4 ml 0.5MNaPo4(pH 7.2), ImMEDTA, 1% 
BSA, and 7% SDS. The antisense probe was synthesized by asymmetric PGR on the 
JC145 amplification in low melt agarose (above), where 2 ^1 DNA is anqjlified with 
2 id dNTP mix (0. 167mM dATP, 0.167mM dTTP, 0. 167mM dGTP, and 0.033mM 
dCTP), 2 III lOX PCR buffer, 10 m1 ^^-dCTP (100 ^Ci; Amersham, Arlington 
Heights, n), 1 III (1(K) pmoles) antisense primer CP-53, 0.5 iil Taq polymerase, and 
dH20 to 20 /il; the lOX PCR buffer, dNTPs and Taq polymerase were from Perkm 
Ehner Cetus (Norwalk, CT), An^lification consisted of 30 rounds of denaturation 
at 94«>C for 45 sec, annealmg of primer to the template at 60^ for 45 sec, and chain 
elongation at 72 C for 1 min. The reaction was stopped by addition of 100 iil TE, 
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and the probe recovered over a 3cc G-50 spin cx)lumn (2 ml G-50 Sephadex 
[Pharmacia, Uppsala, Sweden] in a 3cc syringe plugged with glass wool, 
equilibrated with TE) and counted on a 1500 TriCarb Liquid Scintillation Counter 
(Packard, Downers Grove, IL). The probe was added to the prehybridizing buffer at 
10^ cpm/ml and hybridization was carried out at 60^C for 16 hrs. The blot was 
washed in high stringency conditions: 3x15 min at 65^0 with 0.2%SSC/1% SDS, 
followed by wrapping in plastic wrap and exposure to film at -80^C, A seven hoiu: 
exposmr of this Northern blot analysis revealed a smgle thick band at approximately 
1.7 kb for both RNA collected from the Arboretum tree and the RNA collected from 
the pooled trees from Japan. This message is the expected size for Cryj U as 
predicted by PGR analysis of the cDNA. 

Example 6 

Direct bind in g assay of TgK to Crv i 1. Crv I II and recMnb inant Crv i U. 

Coming assay plates (ijf25882-96) were coated with Cryj I or Cry J n at 2 
^ig/mL or recombinant Cry j U preparation at 10 ng/mL (proximately 20% pure) 
in a volume of 50 \xL overnight at 40C. The coating antigens were removed and the 
wells were blocked with 0.5% gelatin, PVP (polyvinyl pyrolidme) 1 mg/ mL in 
PBS, 200 liUweU for 2 hours at room temperature. The anti-Cry 7 I monoclonal 
antibody, 4B11, was serially dihited in PBS-Tween 20 starting at a 1:1000 dihition. 
The human plasma were serially diluted in PBS-Tween at a starting dilution of 1:2. 
For this set 23 plasma samples from patients symptomatic for Japanese cedar pollen 
allergy chosen for IgE binding analysis. The first antibody incubation proceeded 
overnight at 4^C, Following three washes with PBS-Tween the second antibodies 
were added (goat anti-mouse Ig or goat anti-human IgE both at 1:2000) and 
incubated for two boms at room t^nperature at 100 ^L/weU. This solution was 
removed and streptavidiinHRPO diluted to 1:10,000. was added at lOOfiL/well. The 
color was allowed to develop for 2-5 minutes. The reaction was stopped by the 
addition of lOOfiL/well of IM phosphoric acid. Plates were read on a Microplate 
IL310 Autoreader (Biotek Instrmnents, Winooski, VT) with a 450nm filter. The 
absorbance levels of di^licate wells were averaged. The graphed results Gog of the 
dilution vs. absorbance) of the ELISA assays are shown in Figs. 7 to 15. The 
summary of the results are given in Fig. 16. A positive binding result, indicated by 
a plus sign is determined to be a reading of two-fold or greater above background 
(no first antibody) at the second dilution of plasma (1:6). 

In Fig. 7 the binding response of the monoclonal antibody, 4B11, and seven 
patients' (Batch 1) plasma IgE is shown to purified Cry 7 I as the coating antigen. 
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The moncxrlonal antibody, raised against purified Cry j I shows a saturatmg level of 
bmding for die whole dilution series. The individual patient samples show a variable 
response of IgE binding to the Oy y I preparation. One patient, #1034, has no 
detectable binding to this protein preparation. All the patient samples were obtained 
5 ftom individuals claiming to be symptonmtic for Ji^anese cedar pollen allergy and 
the results of their MAST scores are shown in Fig. 16. Fig. 8 is a gr^h 
representing the binding of the same antibody set as in Fig. 7 to purified native Cryj 
n. The anti-Cry j I monoclonal antibody, 4B11, is negative on this preparation 
demonstradng lack of cross-reactiviQr between the two allergen antigens. In general, 

10 there is a lower overall re^)onse to this allergenic component of cedar pollen widi 

more patient sanaples showing decreased binding. However, patient #1034, that was 
negative on Cry j I shows very strong reactivity to Cryj H. In the last antigen set. 
Fig. 9, using recombinant CryjTL (jCry j II), monoclonal antibody 4B11 reactivity 
is negative and there is further reduction in binding of die human IgE samples 

15 compared to biochemically purified Cryy H. Two of the patients, #1143 and #1146, 
are clearly positive for IgE biixiing to the recombinant form of Cryy II although the 
patient that reacted the strongest to biochemically purified form is negative here, 
1034. Figs. 10-15 represent the application of the same antigen sets for the direct 
binding analysis of the next sixteen patients designated patient Batch 2 and patient 

20 Batch 3 m Figs. 10-15. 

The table shown m Fig. 16 summarizes both the MAST scores, performed in 
Japan on the plasma samples before shipment using a commercially available kit, and 
the direct ELISA results outlined above. Two patients were negative by the MAST 
assay, however, one of these patients, #1143, was positive on all the ELISA 

25 antigens. The number of positive responses for each antigen is shown and this 
represents a measure relative allergenicity of the different allergen prq)arations. 
These results demonstrate that Cry y n is an allergen as defined by human allergic 
patient IgE reactivity and that tfaere are some patients who are not reactive to Cryy I 
but are reactive to Cry j H. The ftequency of response in this population of patients 

30 is less to Cryy n than to Cryy I. 

E3cample7 

Ja panese Cedar PftHen Allergic Patient T Cdl Studies with Crv y H and Cry y D 
35 Peptides, 



Synthesis of Cryy n Peptides 
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Japanese cedar pollen Cry j II peptides designated Cry j HA Cry j IIB were 
synthesized using standard Fmoc/tBoc synthetic chemistry and purified by Reverse 
Phase HPLC. The amino acid sequence of peptide Cry J HA is FTFKVDGIIAAYQ 
(SEQ ID NO: 54) which corresponds to amino acids 116-128 as shown if Figs 4 and 
5 ' 5. The amino acid sequence of peptide Cry j IIB is NGYFSGHVIPACKN (SEQ ID 
NO: 55) which corresponds to amino acids 416-429 as shown in Figs 4 and 5. The 
peptide names are consistent throughout. 

T Cell Responses to Japanese Cedar PoUen Antigen Peptides 

10 Peripheral blood mononuclear cells (PBMC) were purified by lymphocyte 

separation medium (LSM) centrifugation of 60 ml of heparinized blood from one 
Japanese cedar pollen-allergic patient who exhibited clinical symptoms of seasonal 
rhinitis and was MAST and/or skin test positive for Japanese cedar pollen. Long 
term T cell Imes were established by stimulation of 2 X 10^ PBL/ml in bulk cultures 

15 of complete medram (RPMI-1640, 2 mM L-ghitamine, 100 U/ml 

penicillin/streptomycin, 5x10-5M 2-mercaptoethanoU and 10 mM HEPES 
si^lemented with 5% heat inactivated human AB serum) with 10 jig/ml of partially 
purified native Cry j n for 7 days at 37^0 in a humidified 5% CO2 incubator to 
select for Cry j U reactive T cells. This amount of priming antigm was determined 

20 to be optunal for the activation of T cells from most Japanese cedar pollen allergic 
patients. Viable cells were purified by LSM centrifugation and cultured in complete 
medium siq>plemented with 5 units recombinant human IL-2/nil and 5 units 
recombinant human IL-4/ml for up to three weeks until the cells no longer responded 
to lymphokines and were considered "rested". The ability of the T cells to 

25 proliferate to peptides Cry j IIA and Cry j IIB, recombinant Cryj H {rCryj II), 

purified native Cry j H, or purified native Cry j I was then assessed. For assay, 2 X 
10* rested cells were restimulated in the presence of 2 X 10* autologous Epstem- 
Barr virus (EBV)-transformed B cells (prepared as described below) (gamma- 
irradiated with 25,000 RADS) with 2-50 tig/ml of rCryj H, purified native Cryj 

30 n, peptides Cry j HA and Cry j IIB, of purified native Cryj I, in a volume of 200 jil 
con^lete medmm m duplicate or triplicate wells in 96-well round bottom plates for 
2-4 days. The optimal incubation was found to be 3 days. Each well then received 
1 ^lCi tritiated thymidine for 16-20 hours. The counts incorporated were coDected 
omo glass fiber filter mats and processed for liquid scmtillation couritmg. The 

35 mftyiTnuTTi response m a titration of each peptide is ejqiressed as the stimulation nidex 
(S.I.). The S.L is the counts per minute (CPM) incorporated by cells m response to 
peptide, divided by the CPM mcorporated by cells in medmm only. An S.I. value 
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equal to or greater than 2 times the background level is considered "positive" and 
indicates that the peptide contains a T cell epitope. The results of this assay 
indicated that peptides Cr J n, and Cry j HB did noit appear to contain a T cell 
epitope for this particular allergenic patient. However, additional Japanese cedar 
5 pollen allergic patients will be tested in this assay system and one or both of these 
pq)tides may contain T cell epitopes for other allergic individuals. 
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Preparation f (EBV)-traiisfonned B Cells for Use as Antigen Presenting Cells 

Autologous EBV-transformed cell lines were y-irradiated with 25,000 Rad 
and used as antigen presenting cells in secondary proliferation assays and secondary 
bulk stimulations. These cell lines were also used as a control in the innnuno- 
5 fluorescence flow cytometry analysis. These EBV-transformed ceU lines were made 
by incubatmg 5 X 10^ PEL with 1 ml of B-59/8 Marmoset ceU line (ATCC 
CRL1612, American Type Culture Collection, Rockville, MD) conditioned medium 
m die presence of 1 jig/ml phorbol 12-myristate 13-acetate (PMA) at ST^C for 60 
minutes in 12 X 75 mm polypropylene round-bottom Falcon snap cap tubes (Becton 

10 Dickinson Labware, Lincoln Park, NJ). These cells were then dihited to 1.25 X 10^ 
cells/ml m RPMI-1640 as described above except supplemented with 10% heat- 
inactivated fetal bovine senun and cultured in 200 ^1 aliquots in flat bottom culture 
plates until visible colonies were detected. They were then transferred to larger 
wells until the cell lines were established. 

15 Although the invention has been described with reference to its preferred 

embodiments, other embodiments, can achieve the same results. Variations and 
modifications to the present invention will be obvious to those skilled in the art and it 
is intended to cover in the appended claims all such modification and equivalents and 
follow in the true spirit and scope of this invention. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

5 

(i) APPLICANT: 

(A) NAME: IMMOLOGIC PHARMACEUTICAL CORPORATION 

(B) STREET: 610 Lincoln Street 

(C) CITY: Waltham 
10 (D) STATE: MA 

(E) COUNTRY: XJSA 

(F) POSTTOi CODE (ZIP) : 02154 

(G) TELEPHONE: (617) 466-6000 

(H) TELEFAX: (617)466-6040 

(ii) TITLE OF INVENTION: Allergenic Proteins and Peptides From 

Japsmese Cedar Pollen 

(iii) NUMBER OF SEQUENCES: 55 

20 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-TOS 
25 (D) SOFTWARE: ASCII (TEXT) 

(v) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

30 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

35 (viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Vanstone, Darlene 

(B) REGISTRATION NUMBER: 35,729 

(C) REFERENCE/DOCKET NUMBER: IPC-033PC 

40 (ix) TELECCflflMDNICATION INFORMATION: 

(A) TELEPHONE: (617) 466-6000 

(B) TELEFAX: (617) 466-6040 

45 

(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1726 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDHDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

55 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 42..15B6 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

5 TGAGTTCGAG ACAAGTATAG AAAGTiATTTT CTTTTATTAA A ATG GCC ATG AAA 

53 

Met Ala Met Lys 
1 

10 TTA ATT GCT CCA ATG GCC TTT CTG GCC ATG CAA TTG ATT ATA ATG GCG 

XOl 

Leu lie Ala Pro Met Ala Phe Leu Ala Met Gin Leu lie lie Met Ala 
5 10 15 20 

15 GCA GCA GAA GAT CAA TCT GCC CAA ATT ATG TTG GAC AGT GTT GTC GAA 

149 

Ala Ala Glu Asp Gin Ser Ala Gin lie Met Leu Hsp Ser Val Val Glu 
25 30 35 

20 T^AA TAT CTT AGA TCG AAT CGG AGT TTA AGA AAA GTT GAG CAT TCT CGT 

197 

Lys Tyr Leu Arg Ser Asn Arg Ser I*eu Arg Lys Val Glu His Ser Arg 
40 45 50 

25 CAT GAT GCT ATC AAC ATC TTC AAT GTG GAA AAG TAT GGC GCA GTA GGC 

245 

His Asp Ala lie Asn lie Phe Asn Val Glu Lys Tyr Gly Ala Val Gly 
55 60 65 

30 GAT GGA AAG CAT GAT TGC ACT GAG GCA TTT TCA ACA GCA TGG CAA GCT 

293 

Asp Gly Lys His Asp Cys Thr Glu Ala Phe Ser Thr Ala Trp Gin Ala 
70 75 80 

35 GCA TGC AAA AAC CCA TCA GCA ATG TTG CTT GTG CCA GGC AGC AAG AAA 

341 

Ala Cys Lys Asn Pro Ser Ala Met Leu Leu Val Pro Gly Ser Lys Lys 
85 90 95 100 

40 TTT GTT GTA AAC AAT CTG TTC TTC AAT GGG CCA TGT CAA CCT CAC TTT 

389 

Phe Val Val Asn Asn Leu Phe Phe Asn Gly Pro Cys Gin Pro His Phe 
105 110 115 

45 ACT TTT AAG GTA GAT GGG ATA ATA GCT GCG TAC CAA AAT CCA GCG AGC 

437 

Thr Phe Lys Val Asp Gly lie lie Ala Ala Tyr Gin Asn Pro Ala Ser 
120 125 130 

50 TGG A7U3 AAT AAT AGA ATA TGG TTG CAG TTT GCT AAA CTT ACA GGT TTT 

485 

Trp Lys Asn Asn Arg lie Trp Leu Gin Phe Ala Lys Leu Thr Gly Phe 
135 140 145 

55 ACT CTA ATG GGT AAA GGT GTA ATT GAT GGG CAA GGA AAA CAA TGG TGG 

533 

Thr Leu Met Gly Lys Gly Val lie Asp Gly Gin Gly Lys Gin Trp Trp 
150 155 160 



60 



65 



GCT GGC CAA TGT AAA TGG GTC AAT GGA CGA GAA ATT TGC AAC GAT CGT 
581 

Ala Gly Gin Cys Lys Trp Val Asn Gly Arg Glu lie Cys Asn Asp Arg 
165 170 175 180 

GAT AGA CCA ACA GCC ATT AAA TTC GAT TTT TCC ACG GGT CTG ATA ATC 
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Asp Arg Pro Thr Ala lie Lys Phe Asp Phe Ser Thr Gly Leu lie lie 
185 190 195 

5 CAA GGA CTG AAA CTA ATG AAC ACT CCC GAA TTT CAT TTA GTT TTT GGG 

677 

Gin Gly Leu Lys Leu Met Asn Ser Pro Glu Phe His Leu Val Phe Gly 
200 205 210 

10 AAT TGT GAG GGA GTA AAA ATC ATC GGC ATT AGT ATT ACG GCA CCG AGA 

725 

Asn Cys Glu Gly Val Lys lie lie Gly lie Ser lie Thr Ala Pro Arg 
215 220 225 

15 GAC AGT CCT AAC ACT GAT GGA ATT GAT ATC TTT GCA TCT AAA AAC TTT 

773 

Asp Ser Pro Asn Thr Asp Gly He Asp He Phe Ala Ser Lys Asn Phe 
230 235 240 

20 CAC TTA CAA AAG AAC ACG ATA GGA ACA GGG GAT GAC TGC GTC GCT ATA 

821 

His Leu Gin Lys Asn Thr He Gly Thr Gly Asp Asp Cys Val Ala He 
245 250 255 260 

25 GGC ACA GGG TCT TCT AAT ATT GTG ATT GAG GAT CTG ATT TGC GGT CCA 

869 

Gly Thr Gly Ser Ser Asn He Val He Glu Asp Leu He Cys Gly Pro 
265 270 275 

30 GGC CAT GGA ATA AGT ATA GGA ACT CTT GGG AGG GAA AAC TCT AGA GCA 

917 

Gly His Gly He Ser He Gly Ser Leu Gly Arg Glu Asn Ser Arg Ala 
280 285 290 

35 GAG GTT TCA TAC GTG CAC CTA AAT GGG GCT AAA TTC ATA GAC ACA CAA 

965 

Glu Val Ser Tyr Val His Val Asn Gly Ala Lys Phe He Asp Thr Gin 
295 300 305 

40 AAT GGA TTA AGA ATC AAA ACA TGG CAG GCT GGT TCA GGC ATG GCA AGC 

1013 

Asn Gly Leu Arg He Lys Thr Trp Gin Gly Gly Ser Gly Met Ala Ser 
310 315 320 

45 CAT ATA ATT TAT GAG AAT GTT GAA ATG ATA AAT TCG GAG AAC CCC ATA 

1061 

His He He Tyr Glu Asn Val Glu Met He Asn Ser Glu Asn Pro He 
325 330 335 340 

50 TTA ATA AAT CAA TTC TAC TGC ACT TCA GCT TCT GCT TGC CAA AAC CAG 

1109 

Leu He Asn Gin Phe Tyr Cys Thr Ser Ala Ser Ala Cys Gin Asn Gin 
345 350 355 

55 AGG TCT GCG GTT CAA ATC CAA GAT GTG ACA TAC AAG AAC ATA CGT GGG 

1157 

Arg Ser Ala Val Gin He Gin Asp Val Thr Tyr Lys Asn He Arg Gly 
360 365 370 

60 ACA TCA GCA ACA GCA GCA GCA ATT CAA CTT AAG TGC ACT GAC AGT ATG 

1205 

Thr Ser Ala Thr Ala Ala Ala He Gin Leu Lys Cys Ser Asp Ser Met 
375 380 385 

65 CCC TGC AAA GAT ATA AAG CTA ACT GAT ATA TCT TTG AAG CTT ACC TCA 

1253 
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Pro Cys Lys Asp lie Lys Leu Ser Asp lie Ser Leu Lys Leu Thr Ser 
390 395 400 

GGG AAA ATT GCT TCC TGC CTT AAT GAT AAT GCA AAT GGA TAT TTC AGT 
1301 

Gly Lys lie Ala Ser Cys Leu Asn Asp Asn Ala Asn Gly Tyr Phe Ser 
405 410 415 420 

GGA CAC GTC ATC CCT GCA TGC AAG AAT TTA AGT CCA AGT GCT AAG CGA 
1349 

Gly His Val lie Pro Ala Cys Lys Asn Leu Ser Pro Ser Ala Lys Arg 
425 430 435 

AAA GAA TCT AAA TCC CAT AAA CAC CCA AAA ACT GTA ATG GTT GAA AAT 
1397 

Lys Glu Ser Lys Ser His Lys His Pro Lys Thr Val Met Val Glu Asn 
440 445 450 

ATG CGA GCA TAT GAC AAG GGT AAC AGA ACA CGC ATA TTG TTG GGG TCG 
1445 

Met Arg Ala Tyr Asp Lys Gly Asn Arg Thr Arg lie Leu Leu Gly Ser 
455 460 465 

AGG CCT CCG AAT TGT ACA AAC AAA TGT CAT GGT TGC AGT CCA TGT AAG 
1493 

Arg Pro Pro Asn Cys Thr Asn Lys Cys His Gly Cys Ser Pro Cys Lys 
470 475 480 

GCC AAG TTA GTT ATT GTT CAT CGT ATT ATG CCG CAG GAG TAT TAT CCT 
1541 

Ala Lys Leu Val lie Val His Arg lie Met Pro Gin Glu Tyr Tyr Pro 
485 490 495 500 

CAG AGG TGG ATA TGC AGC TGT CAT GGC AAA ATC TAC CAT CCA TAATGAGATA 
1593 

Gin Arg Trp lie Cys Ser Cys His Gly Lys lie Tyr His Pro 
505 510 

CATTGAAACT GTATGTGCTA GTGAATATTC TTGTGGTACA ATATTAGAAC TGATATTGAA 
1653 

AATAAATCAT CAATGTTTCT AAGGCATTTA TAATAGATTA TATTAATGGT TCAGCCTGGT 
1713 

GCAAAAAAAA AAA 
1726 



(2) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 514 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ala Met Lys Leu lie Ala Pro Met Ala Phe Leu Ala Met Gin Leu 
15 10 15 

lie He Met Ala Ala Ala Glu Asp Gin Ser Ala Gin tie Met Leu Asp 
20 25 30 
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Ser Val Val Glu Lys Tyr Leu Arg Ser Asn Arg Ser Leu Arg Lys Val 
35 40 45 

Glu His Ser Arg His Asp Ala lie Asn lie Phe Asn Val Glu Lys Tyr 
50 55 60 

Gly Ala Val Gly Asp Gly Lys His Asp Cys Thr Glu Ala Phe Ser Thr 
65 70 75 80 

Ala Trp Gin Ala Ala Cys Lys Asn Pro Ser Ala Met I*eu Leu Val Pro 
85 90 95 

Gly Ser Lys Lys Phe Val Val Asn Asn Leu Phe Phe Asn Gly Pro Cys 
100 105 110 

Gin Pro His Phe Thr Phe Lys Val Asp Gly lie lie Ala Ala Tyr Gin 
115 120 125 

Asn Pro Ala Ser Trp Lys Asn Asn Arg He Trp Leu Gin Phe Ala Lys 
130 135 140 

Leu Thr Gly Phe Thr Leu Met Gly Lys Gly Val He Asp Gly Gin Gly 
145 150 155 160 

Lys Gin Trp Trp Ala Gly Gin Cys Lys Trp Val Asn Gly Arg Glu He 
165 170 175 

Cys Asn Asp Arg Asp Arg Pro Thr Ala He Lys Phe Asp Phe Ser Thr 
180 185 190 

Gly Leu He He Gin Gly Leu Lys Leu Met Asn Ser Pro Glu Phe His 
195 200 205 

Leu Val Phe Gly Asn Cys Glu Gly Val Lys He He Gly He Ser He 
210 215 220 

Thr Ala Pro Arg Asp Ser Pro Asn Thr Asp Gly He Asp He Phe Ala 
225 230 235 240 

Ser Lys Asn Phe His Leu Gin Lys Asn Thr He Gly Thr Gly Asp Asp 
245 250 255 

Cys Val Jlla He Gly Thr Gly Ser Ser Asn He Vail He Glu Asp Leu 
260 265 270 

He Cys Gly Pro Gly His Gly He Ser He Gly Ser Leu Gly Arg Glu 
275 280 285 

Asn Ser Arg Ala Glu Val Ser Tyr Val His Val Asn Gly Ala Lys Phe 
290 295 300 

He Asp Thr Gin Asn Gly I*eu Arg He Lys Thr Trp Gin Gly Gly Ser 
305 310 315 320 

Gly Met Ala Ser His He He Tyr Glu Asn Val Glu Met He Asn Ser 
325 330 335 

Glu Asn Pro He Leu He Asn Gin Phe Tyr Cys Thr Ser Ala Ser Ala 
340 345 350 

Cys Gin Asn Gin Arg Ser Ala Val Gin He Gin Asp Val Thr Tyr Lys 
355 360 365 

Asn He Arg Gly Thr Ser Ala Thr Ala Ala Ala He Gin Leu Lys Cys 
370 375 380 
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Ser Asp Ser Met Pro Cys Lys Asp lie Lys Leu Ser Asp He Ser Leu 

385 390 395 400 

Lys Leu Thr Ser Gly Lys He Ala Ser Cys I^eu Asn Asp Asn Ala Asn 
405 410 415 

Gly Tyr Phe Ser Gly His Val He Pro Ala Cys Lys Asn Leu Ser Pro 
420 425 430 

Ser Ala Lys Arg Lys Glu Ser Lys Ser His Lys His Pro Lys Thr Val 
435 440 445 

Met Val Glu Asn Met Arg Ala Tyr Asp Lys Gly Asn Arg Thr Arg He 
450 455 460 

Leu Leu Gly Ser Arg Pro Pro Asn Cys Thr Asn Lys Cys His Gly Cys 
465 470 475 480 

Ser Pro Cys Lys Ala Lys Leu Val He Val His Arg He Met Pro Gin 
485 490 495 

Glu Tyr Tyr Pro Gin Arg Trp He Cys Ser Cys His Gly Lys He Tyr 
500 505 510 

His Pro 



(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

Arg Lys Val Glu His Ser Arg His Asp Ala He Asn He Phe Asn Val 
1 5 10 15 

Glu Lys Tyr Gly Ala Val Gly Asp Gly Lys His Asp Cys Thr Glu Ala 
20 25 30 

Phe Ser Thr Ala Trp Gin Ala Ala Cys Lys Asn Pro Ser 
35 40 45 

(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) XiBNGTH: 41 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 
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(xi) SEQUENCE DESCSIIPTION: SEQ ID NO:4: 

Arg LyB Val Glu His Ser Arg His Asp Ala lie Asn lie Phe Asn Val 
5 15 10 15 

Glu Lys Tyr Gly Ala Val Gly Asp Gly Lys His Asp Cys Thr Glu Ala 
20 25 30 

10 Phe Ser Thr Ala Trp Gin Lys Asn Pro 

35 40 

(2) INFORMATION FOR SEQ ID N0:5: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 



25 



30 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Ser Arg His Asp Ala lie Asn He Phe Asn Val Glu Lys Tyr Gly Ala 
15 10 15 

Val Gly Asp Gly Lys His Asp Cys Thr Glu Ala Phe Ser Thr Ala Trp 
20 25 30 



Gin Lys Asn Pro 
35 35 

(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

Ala He Asn He Phe Asn Val Glu Lys Tyr 
15 10 



55 (2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1410 base pairs 

(B) TYPE: nucleic acid 
60 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



65 
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(xi) SEQX3ENCE DESCRIPTION: SEQ ID NO: 7: 

AGAAAAGTTG AGCATTCTCG TCATGATGCT ATCAACATCT TCAATGTGGA AAAGTATGGC 
60 

5 

GCAGTAGGCG ATGGAAAGCIA TGATTGCACT GAGGCATTTT CAACAGCATG GCAAGCTGCA 
120 

TGCAAAAACC CATCAGCAAT GTTGCTTGTG CCAGGCAGCA AGAAATTTGT TGTAAACAAT 
10 180 

CTGTTCTTCA ATGGGCCATG TCAACCTCAC TTTACTTTTA AGGTAGATGG GATAATAGCT 
240 

15 G06TACCAAA ATCCAGCGAG CTGQAAGAAT AATAGAATAT GGTTGCAGTT TGCTAAACTT 

300 

ACAGGTTTTA CTCTAATGGG TAAAGGTGTA ATTGATGGGC AAGGAAAACA ATGGTGGGCT 
360 

20 

GGCCAATGTA AATGGGTCAA TGGACGAGAA ATTTGCAACG ATCGTGATAG ACCAACA6CC 
420 

ATTAAATTCG ATTTTTCCAC GGGTCTGATA ATCCAAGGAC T6AAACTAAT GAACAGTCCC 
25 4 8 0 

GAATTTCATT TAGTTTTTGG GAATTGTGAG GGAGTAAAAA TCATCGGCAT TAGTATTACG 
540 

30 GCACCGAGAG ACAGTCCTAA CACT6ATGGA ATTGATATCT TTGCATCTAA AAACTTTCAC 

600 

TTACAAAAGA ACACGATAGG AACAGGGGAT GACTGCGTCG CTATAGGCAC AGGGTCTTCT 
660 

35 

AATATTGTGA TTGAGGATCT GATTTGCGGT CCAGGCCATG GAATAAGTAT AGGAAGTCTT 
720 

GGGAGGGAAA ACTCTAGAGC AGAGGTTTCA TACGTGCACG TAAATGGGGC TAAATTCATA 
40 780 

GACACACAAA ATGGATTAAG AATCAAAACA TGGCAGGGTG GTTCAGGCAT GGCAAGCCAT 
840 

45 ATAATTTAT6 AGAATGTTGA AATGATAAAT TCGGAGAACC CCATATTAAT AAATCAATTC 

900 

TACTGCACTT CAGCTTCTGC TTGCCAAAAC CAGAGGTCTG CGGTTCAAAT CCAAGATGT6 
960 

50 

ACATACA7U3A ACATACGTGG GACATCAGCA ACAGCAGCAG CAATTCAACT TAAGTGCAGT 
1020 

GACAGTATGC CCTGCAAAGA TATAAAGCTA AGTGATATAT CTTTGAAGCT TACCTCAGGG 
55 1080 

AAAATTGCTT CCTGCCTTAA TGATAATGCA AATGGATATT TCAGTGGACA CGTCATCCCT 
1140 

60 GCATGCAAGA ATTTAAGTCC AA6TGCTAAG CGAAAAQAAT CTAAATCCCA TAAACACCCA 

1200 

AAAACT6TAA TGGTTGAAAA TATGCGAGCA TATGACAAGG GTAACAGAAC ACGCATATTG 
1260 

65 

TTGGGGTCGA GGCCTCCGAA TTGTACAAAC AAATGTCATG GTTGCAGTCC ATGTAAGGCC 
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1320 



AAGTTAGTTA TTGTTCATCG TATTATGCCG CAGGAGTATT ATCCTCAGAG GTGGATATGC 
1380 

AGCTGTCATG GCAAAATCTA CCATCCATAA 
1410 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1395 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECOLE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

TCTCQTCATG ATGCTATCAA CATCTTCAAT GTGGAAAAGT ATGGCGCAGT AGGCGATGGA, 
60 

AA6CATGATT GCACTGAGGC ATTTTCAACA GCATGGCAAG CTGCATGCAA AAACCCATCA 
120 

GCAATGTTGC TTGTGCCAGG CAGCAAGAAA TTTGTTGTAA ACAATCTGTT CTTCAATGGG 
180 

CCATGTCAAC CTCACTTTAC TTTTAAGGTA GATGGGATAA TAGCTGCGTA CCAAAATCCA 
240 

GCGAGCTGGA AGAATAATAG AATATGGTTG CAGTTTGCTA AACTTACAGG TTTTACTCTA 
300 

ATGGGTAAAG GTGTAATTGA TGGGCAAGGA AAACAATGGT GGGCTGGCCA ATGTAAATGG 
360 

GTCAATGGAC GAGAAATTTG CAACGATCGT GATAGACCAA CAGCCATTAA ATTCGATTTT 
420 

TCCACGGGTC TGATAATCCA AGGACTOAAA CTAATGAACA GTCCCGAATT TCATTTAGTT 
480 

TTTGGQAATT QTGAGGQAGT AAAAATCATC GGCATTAGTA TTACGGCACC GAGAGACAGT 
540 

CCTTJ^CACTG ATGGAATTGA TATCTTT6CA TCTAAAAACT TTCACTTACA AAAGAACACG 
600 

ATAGGAACAG GGGATGACTG CGTCGCTATA GGCACAGGGT CTTCTAATAT TGTGATTGAG 
660 

GATCTGATTT GCGGTCCAGG CCATGGAATA AGTATAGGAA GTCTTGG6AG GGAAT^ACTCT 
720 

AGAGCAGAGG TTTCATACGT GCAC6TAAAT GGGGCTAAAT TCATAGACAC ACAAAATGGA 
780 

TTA7W3AATCA AAACATGGCA GGGTGGTTCA GGCATGGCAA GCCATATAAT TTATGAGAAT 
840 
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GTTGAAATGA TAAATTCGGA GAACCCCATA TTAATAAATC AATTCTACTG CACTTCAGCT * 
900 

TCTGCTTGCC AAAACCAGAG GTCTGCGGTT CAAATCCAAG ATGTGACATA CAAGAACATA 
960 

CGTGGGACTIT CAGCAACAGC AfiCAGCAATT CAACTTAAGT GCT^TGACAG TATGCCCTGC 
1020 

AAAGATATAA AGCTAAGTGA TATATCTTTG AAGCTTACCT CAGGGAAAAT TGCTTCCTGC 
1080 

CTTAATGATA ATGCAAATGG ATATTTCAGT GGACACGTCA TCCCTGCATG CAAGAATTTA 
1140 

AGTCCAAGTG CTAAGCGAAA AGAATCTAAA TCCCATAAAC ACCCAAAAAC TGTAATGGTT 
1200 

GAAAATATGC QAGCATATGA CAAGGGTAAC AGAACACGCA TATTGTTGGG GTCGAGGCCT 
1260 

CCGAATTGTA CAAACAAATG TCATGGTTGC AGTCCATGTA AGGCCAAGTT AGTTATTGTT 
1320 

CATCGTATTA TGCCGCAGGA GTATTATCCT CAGAGGTGGA TATGCAGCTG TCATGGCAAA 
1380 

ATCTACCATC CATAA 
1395 



(2) INFORMATION FOR SEQ ID N0:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1479 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GAAGATCAAT CT6CCCAAAT TATGTTGGAC AGTGTTGTCG AAAAATATCT TAGATCGAAT 
60 

CGGAGTTTAA GAAAAGTTGA GCATTCTCGT CATGATGCTA TCAACATCTT CAATGTGGAA 
120 

AA6TATGGCG CAGTAGGCGA TGGAAAGCAT GATTGCACTG AGGCATTTTC AACAGCATGG 
180 

CAAGCTGCAT GCAAAAACCC ATCAGCAATG TTGCTTGTGC CAGGCAGCAA GAAATTTGTT 
240 

GTAAACAATC TGTTCTTCAA TGGGCCATGT CAACCTCACT TTACTTTTAA GGTAGATGGG 
300 

ATAATAGCTG CGTACCAAAA TCCAGCGAGC TGGAAGAATA ATAGAATATG GTTGCAGTTT 
360 

6CTAAACTTA CAGGTTTTAC TCTAATGGGT AAAGGTGTAA TTGATGGGCA AGGAAAACAA 
420 
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TGGTGGGCTG GCCAATGTAA ATGGGTCAAT GGACGAGAAA TTTGCAACGA TCGTGATAGA * 
480 

CCAACAGCCA TTAAATTCGA TTTTTCCACG GGTCTGATAA TCCAAGGACT GAAACTAATG 
540 

AACAGTCCCG AATTTCATTT AGTTTTTGGG AATTGTGAGG GAGTAAAAAT CATCX3GCATT 
600 

AGTATTACGG CACCGAGAGA CAGTCCTAAC ACTGATGGAA TTGATATCTT T6CATCTAAA 
660 

AACTTTCACT TACAAAAGAA CACGATAGGA ACAGGGGATG ACTGCGTCGC TATAG6CACA 
720 

GGGTCTTCTA ATATTGTGAT TGAGGATCTG ATTTGCGGTC CAGGCCAT6G AATAAGTATA 
780 

GGAAGTCTTG GGAGGGAAAA CTCTAGAGCA GAGGTTTCAT ACGTGCACGT AAATGGGGCT 
840 

AAATTCATAG ACACACAAAA TGGATTAAGA ATCAAAACAT GGCAGGGTGG TTCAGGCATG 
900 

GCAAGCCATA TAATTTATGA GAATGTTGAA ATGATAAATT CGGAGAACCC CATATTAATA 
960 

AATCAATTCT ACTGCACTTC AGCTTCTGCT TGCCAAAACC AGAGGTCTGC GGTTCAAATC 
1020 

CAAGATGTGA CATACAAGAA CATAC6TGGG ACATCAGCAA CAGCAGCAGC AATTCAACTT 
1080 

AAGTGCAGTG ACAGTATGCC CTGCAAAGAT ATAAAGCTAA GTGATATATC TTTGAAGCTT 
1140 

ACCTCAGGGA AAATTGCTTC CTGCCTTAAT GATAATGCAA ATGGATATTT CAGTGGACAC 
1200 

GTCATCCCTG CATGCA7VGAA TTTAAGTCCA AGTGCTAAGC GAAAAGAATC TAAATCCCAT 
1260 

AAACACCCAA AAACTGTAAT GGTTGATUUIT ATGCQAGCAT ATGACAAGGG TAACAGAACA 
1320 

CGCATATTGT TGGGGTCGAG 6CCTCCGAAT TGXACAAACA AATGTCATGG TTGCA6TCCA 
1380 

TGTAAGGCCA AGTTAGTTAT TGTTCATCGT ATTATGCCX3C AGGAGTATTA TCCTCAGAGG 
1440 

TGGATAT6CA GCTGTCATGG CAA/IATCTAC CATCCATAA 
1479 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GGGTCTAGAG GTACCGTCCG TCCGATCGAT CCATT 
35 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQTJENCE DESCRIPTION: SEQ ID NO: 11: 

AATGATCGAT GCT 
13 

(2) INFORMATION FOR SEQ ID NO:12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

RTAYTTYTCN ACRTTRAA 
18 

(2) INFORMATION FOR SEQ ID N0:13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) IiENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GGGTCTAGAG 6TA 
13 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTB: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(v) FRAGMENT TYPE: internal 



(Xi) SEQXJENCE DESCRIPTION: SEQ ID NO: 14: 

Phe Asn Val Glu Lys Tyr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOIiOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CCTGCAGTAY TTYTCNACRT TRAANAT 
27 

(2) INFORMATION FOR SEQ ID NO:16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



CCTGCAG 
7 



(2) INFORMATION FOR SEQ ID NO:17: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 7 amino acids 
(B) TYPE: amino acid 
(D) TOPOIiOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

lie Phe Asn Val Glu Lys Tyr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 18: 



10 



15 



25 



45 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : single 
(D) TOPOIiOGY: linear 

(ii) MOIiECUIiE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

CCTGCAGTAY TTYTCNACRT TRAADAT 
27 

(2) INFORMATION FOR SEQ ID NO: 19: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOIiOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



30 GCNATHAAYA THTTYAA 

17 



35 (2) INFORMATION FOR SEQ ID N0:20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

50 Ala lie Asn lie Phe Asn 

1 5 

(2) INFORMATION FOR SEQ ID NO: 21: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 
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(xi) SEQtJENCE DESCRIPTION: SEQ ID NO: 21: 
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GGAATTCCX3C NATHAAYATH TTYAAYGT 
28 

(2) INFORMATION FOR SEQ ID N0:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQXJENCE DESCRIPTION: SEQ ID NO:22; 



GGAATTCC 
8 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRA04ENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Ala lie Asn lie Phe Asn Val 
1 5 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

GCYTCNGTRC ARTCRTGYTT 
20 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



10 



25 



30 



40 



45 
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(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Lys His Asp Cys Thr Glu Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 26: 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STOANDEDNESS: single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26; 

GGCTGCAGGT RCARTCRTGY TTNCCRTC 
28 

(2) INFORMATION FOR SEQ ID NO:27: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
3i5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 



GGCTGCAG 
8 



(2) INFORMATION FOR SEQ ID NO:28: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 amino acids 
50 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

55 (v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Asp Gly Lys His Asp Cys Thr 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 29: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) IiENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOIiECULE TYPE: cPHA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

ATGTTGOACA GTGTTGTCGA A 
21 

(2) INFORMATION FOR SEQ ID NO: 30; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

GGGAATTCAG AAAAGTTGAG CATTCTCGT 
29 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

GGGAATTC 
8 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
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GTTCTTCAAT GGGCCATGT 
19 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

GTGTTAGGAC TGTCTCTCGG 
20 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

TGTCCAGGCC AT6GAATA7U3 
20 



(2) INFORMATION FOR SEQ ID NO:35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECDX-E TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

GCCTTACATG GACTGCAACC 
20 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECOLE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

TCCACGGGTC TGATAATCCA 
20 

(2) INFORMATION FOR SEQ ID NO:37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

AGGCAGGAAG CAATTTTCCC 
20 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

TACTGCACTT CAGCTTCTGC 
20 

(2) INFORMATION FOR SEQ ID N0:39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEN{3TH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

GGGGGTCTCC GAATTTATCA 
20 

(2) INFORMATION FOR SEQ ID NO:40: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECDLE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

GGATATTTCA GTGGACACGT 
20 



(2) INFORMATION FOR SEQ ID NO: 41: 



(i) SEQXJENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



25 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

30 TATTAGAAGA CCCTGCGCCT 

20 

(2) INFORMATION FOR SEQ ID NO:42: 

35 <i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

40 

(ii) MOLECDLE TYPE: cDNA 



45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

CCATGTAAGG CCAAGTTAGT 
20 

50 (2) INFORMATION FOR SEQ ID NO:43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) X«EN6TH: 20 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 



ACACCTTTAC CCATTAGAGT 
65 2 0 
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10 



(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQX3ENCE CHARACTERISTICS: 

(A) IiENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 



CTGTCCAACA TAATTTGGGC 
20 



20 



25 



30 



(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDKA 



35 



40 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 

CATGGCAGGG TGGTTCAGGC 
20 

(2) INFORMATION FOR SEQ ID NO:46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 



TAGCCCCATT TACGTGCACG 
20 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

TTGGGGTCGA GGCCTCCGAA 
20 

(2) INF0RM;ITI0N for SEQ ID NO:48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

TAAAAUGGC 
9 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) IiENGTH: 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

AACAAUGGC 
9 



(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) XiENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

GCCGAATTCA TGGCCATGAA ATTAATT 
27 

(2) INFORMATION FOR SEQ ID NO:51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

GCC6AATTC 
10 9 



(2) INFORMATION FOR SEQ ID NO: 52: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDKESS : single 

(D) TOPOLOGY: linear 

20 

(ii) MOLECULE TYPE: cDNA 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

CGGGGATCCT CATTATGGAT GGTAGAT 
27 

30 

(2) INFORMATION FOR SEQ ID NO:53: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

40 (ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

45 

CGGGGATCC 
9 



50 



(2) INFORMATION FOR SEQ ID NO:54: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 
55 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

60 (v) FRAGMENT TYPE: internal 



65 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
Phe Thr Phe Lys Val Asp Gly He He Ala Ala Tyr Gin 
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(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOUSCOLE TYPE: peptide 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Asn Gly Tyr Phe Ser Gly His Val He Pro Ala Cys Lys Asn 
15 10 
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1. A nucleic acid having a nucleotide sequence coding for a Japanese Cedar 
pollen allergen Cry j n, or at least one antigenic fragment thereof, or the 
functional equivalent of said nucleotide sequence. 

2. A nucleic acid of claim 1 wherein said nucleotide sequence consists 
essentially of at least one fragment of the coding portion of the nucleotide 
sequence of Fig. 4 (SEQ ID NO: 1). 

3 . A nucleic acid of claim 2 wherein said fragment comprises bases 108 through 
1586 (SEQ ID NO: 9) of the nucleotide sequence of Fig. 4 (SEQ ID NO: 1). 

4. A nucleic acid of claim 1 wherem said nucleotide sequence consists 
essentially of the nucleotide sequence of Fig, 4 (SEQ ID NO: 1). 

5. A nucleic acid of claim 1 wherein said fragment comprises bases selected 
from the group consisting of bases 177 through 1586 (SEQ ID NO: 7) of tiie 
nucleotide sequence of Fig. 4, and bases 192 through 1586 (SEQ ID NO: 8) 
of die nucleotide sequence of Fig. 4 (SEQ ID NO: 1). 

6. An e)q)ression vector conq}rising a nucleotide sequence coding for a Japanese 
cedar pollen allergen Cry j H, or at least one antigenic fragment thereof, or 
the functional equivalent of said nucleotide sequence. 

7. An expression vector of claim 6 wherein said nucleotide sequence consists 
essentially of at least one fragment of the coding portion of the nucleotide 
sequence of Fig. 4 (SEQ ID NO: 1). 

8. An expression vector of claim 6 wherem said nucleotide sequence comprises 
bases 108 through 1586 (SEQ ID NO: 9) of the nucleotide sequence of Fig. 
4. 

9. A host cell transformed to express a protein or pq)tide encoded by the nucleic 
acid of claim 1. 

10. Isolated Cry j U protein, or at least one antigenic fragment thereof, produced 
in a host cell transformed with the nucleic acid of clafan 1. 
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An antigenic fragment of claim 10 which does not bind immunoglobulin E 
specific for a Japanese cedar pollen allergen, or if binding of said antigenic 
fragment to said inununoglobulin E occurs, such binding does not result in 
hiRtaniine release from mast cells or basophils. 

An antigenic fragment of claim 10 which binds immimoglobulin E to a 
substantially lesser extent than purified, native Cryj n protein binds said 
immunoglobulin E. 

Isolated Cry j U protein of claim 10 wherein the host cell is E.coli . 

A method of producing Cry j JL protein, or at least one fragment thereof, 
comprising the steps of: 

a. culturing a host cell transformed widi a DNA sequence encoding Cryj 
n protein or fragment thereof, in an appropriate medfami to produce a 
mixture of cells and medium containing Cry j U protein or at least one 
fragment thereof; ami 

b, purifying said mixture to produce substantially pure Cry j H protein, 
or at least one fragment thereof. 

A protein preparation comprising Cry j II protein, or at least one fragment 
thereof, synthesized in a host cell transformed with a nucleic acid conqnismg 
a nucleotide sequence encoding all or a portion of Cryj n. 

A protein preparation of claim 15 wherein said at least one fragment of Cryj 
n is an antigenic fragment. 

A protein preparation con^rising chemically synthesized Cryj n protein, or 
at least one fragment thereof. 

A protein preparation of claim 15 wherein said Cryj U protein comprises an 
amino acid sequence shown in Fig. 4 (SEQ ID NO: 2). 

A protein preparation of claim 17 wherein said Cry j n protein comprises an 
amino acid sequence shown in Fig. 4 (SEQ ID NO: 2). 
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20. An isolated peptide comprising at least one T cell epitope of Cryj n. 

21 . An isolated peptide of claim 20 which as minimal immunoglobulin E 
stimulating activity. 

22. An isolated peptide of claim 20 which does not bind immunoglobulin E 
specific for a Japanese cedar pollen allergen, or if binding of the peptide to 
said immunoglobulin E occurs^ such binding does not result in histamine 
release from mast cells or basophils. 

23. An isolated peptide of claim 20 which binds immunoglobulin E to a 
substantially lesser extent than purified native Cryj TL protem binds said 
immunoglobulin E. 

24. Isolated Cry j n protein, or an antigenic fragment thereof, which modifies, m . 
an individual sensitive to Japai^se cedar pollen to whom it is administered, 
the allergic response of the individual to a Japanese cedar pollen allergen. 

25 . Isolated Cry j n protein or antigenic firagment of claim 24 which modifies B- 
cell response of the individual to a Js^anese cedar pollen allergen, T-cell 
response of the individual to a Japanese cedar pollen allergen, or both the B- 
cell response and the T-cell response of the mdividual to a Japanese cedar 
pollen allergen. 
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Modified Cry j E protein or at least one modified Augment thereof, which 
when administered to an individual sensitive to Japanese cedar pollen, 
reduces the allergic response of the individual to Cry j H. 

A therapeutic composition comprising isolated Cry j TL protein, or at least one 
fragment thereof, and a pharmaceutically acceptable carrier or diluent. 

A therapeutic composition of claim 27 wherein said Cry J U protein 
comprises an amino acid sequence shown in Fig. 4 (SEQ ID NO: 1). 

A method of treating sensitivity to a Japanese cedar pollen allergen, or an 
allergen immxmologically cross-reactive with a Japanese cedar poUen 
allergen, in an individual sensitive to said allergen, comprising administering 
to the individual a therap^itically effective amount of the conq^osition of * 
claim 27. 

A method of detecting sensitivity in an individual to a Japanese cedar pollen 
allergen, conq)rising combining a blood ^mple obtained fix)m the individual 
with isolated Cry j U protein, or antigenic fragment diereof , produced in a 
host cell transformed with the nucleic acid of claim I or chemically 
synthesized, under conditions appropriate for binding of blood components 
with the protein or fragment thereof, and determining the extent to which 
such binding occurs. 

A method of claim 30 wherein the extent to which binding occurs is 
determined by assessing T cell function, T cell proliferation, B cell function, 
binding of the protein or firagment thereof to antibodies present in the blood 
or a combination thereof. 

A monoclonal antibody, polyclonal antibody or immunoreactive fragment 
thereof, specifically reactive with Cry j 11 protein, or at least one antigenic 
fragment thereof. 

Cry J n protein isolated firom Japanese cedar pollen, said proteia having a 
molecular weight of about 40 kD as determined by sodium dodecyl sulfate- 
polyacrylamide gel electrophoresis. 
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34. A host cell transfonned with a vector containing the cDNA insert of Cry j H, 
said host cell having ATCC deposit number 691QS. 



35, 

5 



A recombinant DN A molecule comprising a DNA coding for a polypeptide 
having at least one epitope of the protein alleigen. Cry j n. 
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TAAAATGGCCATGAAATTA.-.'PTG 
K A K K L I 



70 80 90 iOO ilO 120 

I I I ! I ' 

CTCCAATGGCCTTTCTCGCCATGCAATTGATTATAftTGGCGGCAGCAGAAGATCAATCTC 

APMAFLAMQLIIMAAAEDQS 
10 20 

130 140 150 160 170 180 

I I .1 I I . ' 

CCCAAATTATGTTGGACAGTGTTGTCGAAAAATATCTTAGATCGAATCGGAGTTTAAGAA 

AQIMLDSVVEK YLRSNRSLR 
30 40 

190 200 210 220 230 240 

I I I I I ' 

AAGTTGAGCATTCTCGTCATGATGCTATCyACATCTTCaATGTGGAAAAGTATGG^ 

KV E HSRHDA INIFNVEKYGA 
50 60 

250 260 270 280 290 300 

1 I I I I > 

TAGGCGATGGAAAGCATGRTTGCACTGAGGCy^TTTTCAACAGO^TGGCAAGCTGCATC 

VGDGKHDCTEAFSTAWQAAC 
70 80 

310 320 330 340 350 360 

I I I I I I 

AAAJVCCCATOVGCAATGTTGCTTGTGCCAGGCAGCAAGAAATTTGTTGTAAAC^ 
KNPSAM LLVPGSKKFVVNNL 

90 100 

370 380 390 400 410 420 

I I.I I I I 

TCTTC7^TGGGCCATGTCAACCK»CTTTACTTTTAAGGTAGATGGGATAAT^ 

FFNGPCQPH FTFK VDG I lAA 

110 120 

430 440 450 460 470 480 

I I 1 I I I 

ACCAAAATCCAGCGAGCTGGAAGAATAATAGAATATGGTTGCAGTTTGCTAAACTTACAG 

Y Q N P A S W K N N R I W L Q F A K L T 
130 140 
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1090 1100 1110 1120 1130 .1140 

I I I I I ' 

GCACTTCAGCTTCTCCTTGCCAAAACCAGAGGTCTGCGGTTCAAATCCAAGATGTGACAT 

CTSASACQWQRSAVQIQDVT 
350 360 

1150 1160 1170 1180 1190 1200 " 

I I I I ' ' 

ACJ^GAACATACGTGGGACATCAGCAACAGCAGCyiGCAATTCyACTTAAGT^ 

YKNIRG TS ATAAAIQLKCSD 
370 380 

1210 1220 1230 1240 1250 1260 

GTATCCCCTCCAAAGATATLAGCTAAGTaATATATCTTTGAAGCT^^^^ 
SMPC KD IKLSDISLKLTSGK 

390 400 

1270 1280 " 1290 1300 1310 1320 

I I I I I ' 

TTGCTTCCTGCCTTAATGATAATGCAAATGGATATTTCAGTGGACJVCGTXyiTCCCT^^ 

lASCLNDNANGYFSGHVI PA 
410 . 420 . 

1330 1340.. 1350 1360 1370 1380 

I I I I ' ' 

GCAAGAATTTAAGTCCAAGTGCTAAGCGAAAAGAATCTAAATCCCATAAACACCCAAAAA 

CKNLS PSAKRKESKSHKHPK 
430 . 



1390 1400 1410 1420 1430 1440 

I I I I I ' 

CTGTAATGGTTGAAAATATCCGAGCATATGACAAGGGTAACAGAACACGCATATTGTTGG 

TVMVENMRAYDKGNRTRILL 
450 460 
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— --A-A^j:..CAAAT37CATGG-r:-GCA3TCCATGT;^AG:iCCAAG': 

- - V C H G C S ■? C K A }' 
4fC 



2510 1520 1530 1540 1550 1560 

TAGTTATTC'l«rcATCGTO4ATGCC<;CAGGAGTATTATCCTCAGAG^ 
LVIVHRlMPQEYYPQRvJ .lCS 

490 500 



1570 1580 1590 IbUU aoxu 

GTCATGGCAi^TCTACCATCCATAATGAGATACATTGAAACTGTATGTGCTAGTGAA^^ 

CHGKIYHP- 
510 514 

1630 1640 1650 1660 1670 1680 

TTCTTCTCGTACAATATTAGAACTCATATTGAAAATAAATCATCAATC 

1690 1700 1710 1720 

I I I I 

TTATAATAGATTATATTAATGGTTCAGCCTGGTGCAAAAAAAAAAA-3 ' 
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.1.1 

SO tO '0 80 

Sl^.£LRK^.^H£RHDAiNaFfr7EKyGAVGDGKH]XTEAFST 

90 100 110 ■ 120 

lit) 
AWOAACKNPSAMLiVPGSKKF\^n«^LFFNGPCOPHFTFKV 

130 140 150 160 

I il I I 

IX3I lAAYQNPASWKJJNRIVa/QFAJXTCFTU^OT 

170 180 190 200 

I I I I 

KQWWAGQaCVATNGREICNDRDRPTAIKFDFSTGLIIQGLK 

210 220 230 240 

I ■ I I 1 

U^NSPEFHLVFGNCEGVKI IGISITAPRDSPNTDGIDIFA 

250 260 270 280 

lilt 
SKNFHLQKNTIGTGDDCVAIGTGSSNIVIEDLICGPGHGI 

290 3D0 310 320 

; I.I I 

SIGSLGRENSRAEVSY^/HVNGAKFlDTQNGLRIKTlN-QGGS 

330 340 350 360 

I I I 1 

GMASHIIYENVEMINSENPILINQFYCTSASACQNQRSAV 

370 380 390 400 

1 I I I 

QIQDVTYKNIRGTSATAAAIQLKCSDSMPCKDIKLSDISL 

410 420 430 440 

I I I I 

KLTSGKIASCLNDNAM5YFSGHVIPACKNLSPSAKRKESK 
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